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I. BACKGROUND 


A. INTRODUCTION 

Interest in storage reclamation methods, in particular 
ExubscaEcoJlection. has increased dramatically the past 
Es rs Storage reclamation is the process of re- 
wne discarded information and returning the memory space 
occupied by that discarded information to an available area 
Est in memory that is available to be utilized by pro- 
wers. This increased interest in storage reclamation is 
Ae Su lt Of the increasing use of list-processing environ- 
ments. A list-processing environment is a system in which 
the language manipulates data structures called "lists." In 
DNS words, in a Yrst-processzng environment, the information 
to be reclaimed is in the form of list cells. 

One of the better known and more popular languages that 
memomlates lists is LISP, which is designed to facilitate 
peocramming of complex symbolic processes. - It eases this 
burden by providing automatic storage allocation and reclamation 
NEL: p. 522]. 

Ine tie periodi of time that storage reclamation is 
taking place, program execution comes to a halt. While this 
me A anod may appear to be insignificant to the programmer 
(say in terms of seconds), it can become frustrating and be- 


comes increasingly larger as the number of programmers on the 


system increases, resulting in longer "wait' times for each 
user. Therefore, if it were possible to conduct storage 
reclamation without having to Stop program execution i 
would allow program execution to continue at the same time 
that the necessary storage reclamation operations are taking 
pee ee. 

Experience with large LISP programs indicates that LUN 
30$ of their execution time is spent in garbage collec 
[Ref. 2: p. 341]. This paper investigates the feasibility of 
being able to perform garbage collection concurrently wires 


program execution nm Pp. 


BSOS 

As programs that utilize extensive list-processing bes 
come more and more common and as they continue to grow and 
become more complex, it becomes essential that a method for 
efficient and real-time storage reclamation be used. Delays 
due to storage reclamation are a nuisance to programmers and 
researchers but they could be of critical importance in seme 
applications. For example, a natural language interface to 
an emergency medical database, that is written in a list- 
processing language such as LISP, could be considered untrust- 
worthy 1f garbage collection caused lengthy delays [Ref. 3: 
p. 1143]. A method for returning discarded information to 
memory (to a structure called the "free list") is an esse 


ingredient of any list-processing system. This free list, a 
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space in memory that is available to be utilized by user 
Droprams o comtbaias all cells that are not being used by 
oro (se. cells are system representations of memory 
Saco) nai tially, the free list contains all storage not 
pica the programs; cells Ire removed from it and 

formed into list structures as the programs are executed 

DN SOI lhe term “working™list" 1s the set or list 
of cells that is currently being utilized by a user's program. 

There are several storage reclamation methods. Each 
wod is described in the following chapter. One of the 
Guessucal methods, and the one that most LISP implementations 
use for reclaiming unused memory space, is called garbage 
commtec tion. Basically it is the operation of first marking 
ali cells in memory reachable from the main program and then 
mieapane or returning all unmarked cells to an available free 
list [Ref. 5: p. 491]. Garbage collection is usually invoked 
only after the main program has run out of memory. The time 
when garbage collection is invoked depends upon the implementa- 
tion of the storage reclamation method on the computing system 
in question. 

A requirement of any storage reclamation method, including 
garbage collection, is to return cells back to memory, or the 
free list, only when that cell is no longer needed by the 
program (i.e. it is discarded and no longer accessible from 
the program). Storage reclamation causes program interruption 


normally when the program has no more memory to use (i.e. no 
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more cells in the free List), and@ tus puoi © xceunmmen 


1s suspended until the storage reclaimer has finished. 


C. LISP LANGUAGE 

It is assumed that the reader has a fundamental grasp 
and understanding of LISP. LISP was selected as the language 
to investigate the feasibility of CONCURRENT storage reclama- 
tion because it is used for highly interactive programming. 
The language has several properties, including program/data 
equivalence, that enable a certain style of programming to 
develop that is characterized by powerful interactive sup- 
port for programmers, non-standard program structures and 
non-standard program development methods [Ref. 0: D IF 

A LISP list is really nothing more than a linear list of 
elements called cells. These cells have fields and may mE 
tain pointers to other lists. The list is a "finite seges 
of zero or more cells or other lists" [Ref )) 7: p. 400 
cell can be thought of as one or more continuous computer 
words, representing memory space, that can be made available 
to a user [Ref. 8: p. 534]. These cells are requested from 
the free list by the user's program. Because there is a 
finite number of available cells, there may come a time that 
there are no more cells still in memory for the program e 
use. When this happens, LISP uses an automatic methods 
reclaiming discarded cells of themuser seprocram. lee 


method is called garbage collection or "regular" garbage 


152 


collection. A cell becomes discarded (commonly referred to as 
"scarbage") when it can no longer be pointed at or accessed 
through the pointer fields of any reachable or accessible cell. 
AS Meet to the garbage colMedtor*to reclaim this 
Ios and return these cells to the free list (Ref. 2: 
foc) thee garbage collector cam be implemented using 
hardware methods, software methods, or a combination of both 
hardware and software. 

A cell is called "accessible" if it is reachable from at 
least one root via any directed path. LISP terminology will 
be used throughout the remainder of this paper and the words . 
"cell" and "node" will be used interchangeably. 

Edemmemode Mas a separate idémtity, which means that it 
can be identified independently of the structure of any di- 
Memtedesraph. Finding this node from a "root" and finding 
Se Oon right successors ok this node are called "primitive 
operations." me process of Mocating or finding the pre- 
decessors of this node involves a search through the entire 
list of nodes; ier through the entire area of memory allocated 
mme users programs. Wis is the reason that identifying 
garbage is not such a simple or easy task. The task is dele- 
gated to the garbage collector, which maintains the free list. 
Again, the free list is just a collection of nodes that have 
becin bied as: "garbage" amd are available to a user's 


program. 
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LISP is a high level language and as such contains 
primitives that automatically call the garbage collection 
LISP the function cons triggers the garbage collector witem 
no more free cells are available. 

LISP functions are called S expressions (wiere "Sn 
symbolic). The basic elements of an S-expression are Calas 
atoms. These expressions are surrounded by parenthesis and 
these parenthesis must balance in a meaningful way. For ex- 
amplewwthe Expression: 

(add 8 9)) (add(add 2 1) 

is not a proper S-expression. However, the following enpm EEE 
¡on is a properly written S-P- STON: 

(times (difference 8 6) (sum 3 1)). 

In the latter expression the proper answer is: 8$. 

Although it is awkward to use functional notation for 
Simple arithmetic, one of the good features of LISP Touma 
everything in the language can be expressed as a func uei 
[Ref. 9: p. 140]. These functions are Tepresented Da 
basic data structure of LISP, which is the list. When the 
components or cells of a list GonGain = pommters =n emo 
cell fields, then the list data structure topology resembles 
a tree. The head of this tree is called the root and the 
elements of the tree are called nodes. For example, in DENUES 
1.1 the S-expression used above is expressed in a tree-like 


structurer 
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times 


difference sum 


as 





Figure 1.1 S-expression Tree. 


cmo AMS? functions are car and cdr, which respect- 
Nate ana provide the first element in a list and the 
remaining elements in the list (if there are any) [Ref. 9: 
AN. Another feature of LISP which is invaluable in an 
Mieeractive System is that LISP can manipulate symbols as well 
as numbers. For example, the function (SETQ K 95) serves a 
due purpose in that it nl cestas an ass iSimentistate- 
MS DUt tt also gives the variable "K" the value of 95. 
Miereas thefeunctrion (SET K E) equates the variable that is 
Male CO FK to the value of "E" and takes-on whatever 
ume that "E" has. 

ery atom im Lise has a value. Numbers or literals are 
atoms whose values are themselves, Bor the value of an atom 
also can be another atom with a symbolic name.  NIL is the 
mE uStor that indicates the end of a list. 

Mé next chapter provides a look at the various methods 


otto rage reclamation. 
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II. STORAGE RECDAMSBIONSPEEGHBNMSUES 


There are two basic methods for Storage Reclamation: 


(1) Manual -- 
(a) the responsibility for reclamation lies with tie 
programmer. 
(2) Automatic -- Eliminates programmer responsibility Tammi 


includes the following two subtypes: 
(a) Reference counts 
The list manipulation primitives maintain a Te temas 
count for each cell that indicates the number of other 
cells whücehopousnt togmt LRef. lUo DM 
(b) and garbage collection. 
Each of these methods is discussed further in the subse- 
quent sections of this chapter. 
A. PROGRAMMER RESPONSIBILITY STORAGE RECLAMATION 
The first method of storage reclamation is the manual or 
programmer responsibility method. There are several languages 
that provide the capability for the programmer to allocate 
Storage and deallocate storage (i.e. storage reclamation). 
IPL-V includes instructions that cause lists and list stp 
to be erased and their cells to be returned to the free list 
[Ref. 4: p. 501]. These instructions are for use by themes 
grammer; what this means is that the programmer has to keep 
track of the current status of all lists, sublists, and Gian 
data structures. One immediately recognized disadvantage of 


this method is that, in addition to maintaining a tablewom 
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uL cot 50sta structure, the programmer, when "de- 
EE GUN oaee. may accidentally erase a- cell (or even 
cis that are being shared with other lists: l.e. 
EI I d uby cells in other lists. These other cells 
eee required in the user's program and be accessible 
rom) to the user's program. This will result in the "dangling 
reference" problem, which is discussed later in this chapter. 

ENSE unouacses. such as Pascal, use "explicit erasure," 
which means that whenever a running program no longer re- 
Mes a particular cell, then the programmer himself must 
ic ly return at to the free storage area [Ref. 11: p. 
440]. Pascal uses two procedures to accomplish this task: 

(1) The "dispose" procedure returns to the free storage 

area whatever cell was pointed to by the argument of 
"dispose." This is done by linking the discarded 

cell onto the free list. Later, the storage allocator 
AA coreuse this cell from the free list to 
satisfy a request from a user's program. 

(2) The "new" procedure obtains an available cell from the 

free list to satisfy a request for memory space from 
a user's program. 

There are several other inherent disadvantages in this 
method of storage reclamation in which the programmer 
laers the burden of allocating and deallocating memory 
spaces. The programmer has to work harder. He has to re- 
member additional items, such as the current status of each 
pM Uhether that cell is still active (i.e. still re- 


quired for his program) or if the cell is non-active (garbage). 


ME e ng system, this also means that he has to 
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remain aware of which lists, sub@rsts, cells To 
etc. are active and which aré non-active. hrs 1s a Taree 
task and one that few programmers can accomplish successfully 
and accurately, especially as the size and complexity ormie 
user's program increases. The programmer's main task should 
be to concentrate on the more important issues-at-hand, such 
as program structure and organization. 

As mentioned earlier, there may be times when a cell will 
be returned to the free list but that cell is still being 
pointed at by other cells in the same list or possibly aL 
list. In either situation, this results in what has been 
called the dangling rererence problem: pe 2.1 111USt mae 
a simple example of this problem. 

The dangling reference problem occurs when a cell is 
classified as "garbage" and returned to the free list but 
the cell still has pointers from allocated cells in the 
user's program. Why is this such a problem? In Figure mM 
cells H, B, C, and E have been designated as "garbage" and 
are returned to the free list. When this happens, cells 
D, and F are said to have dangling references because they 
reference (i.e. have pointers to) non-existent cells. Now 
the problem is that whenever the storage allocator reuse 
cell B for the user's program, these dangling references will 
most likely cause undesired and unpredictable side effects 


(if pointers are not reinitialized). 
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User's 
Program 





Figure 2.1 Dangling Reference Example. 

Some LISP processors provide a feature allowing the user 
Amnn oke the storage collector. Normally, the storage re- 
A ton method used is a type of garbage collection. This 
Ca be a veny helpful feature»if the programmer has a good 
idea of when would be the best time to invoke the collector. 

In computing systems where the user has the capability to 
MM the garbage collector, the programmer is concerned 
a there is sufficient memory for his program to run without 
"crashing.' He does this by knowing how much memory his pro- 
gram has used and how much more memory is still available 
A an as As a difficult assignment in an inter- 


active system where the number of users fluctuates and each 
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user is trying to figure out the memory requirements for his 
program. Forgetting just one pointer to a cell that has bem@ 
sent to the free list will result in the dangling reference 
problem. In LISP systems where the programmer determines 
allocation and deallocation, the function “return” can “aa 
cause this problem. The only way that a programmer can 
stop this problem from occurring is to take extreme camis 
NOT to reuse any cell that he may think has a pointer to m 
Again, this is just another thing for the programmer aue 
ic cnl ee 

Because of the dangling reference problem and the additional 
programmer burden, the MANUAL, reclamation method in LISPESN 
any other language that has such features is NOT considered 


an adequate selection for concurrent storage reclamation 


B. REFERENCE COUNT STORAGE RECLAMATION 

The second method of automatic storage reclamation is 
called the “Tretewemee count meda This means that We 
programmer "Ss responsibility for Lots toma allocation and 
deallocation is eliminated and is automatically taken care 
of by the system. The reference count method of storage 
reclamation requires an additional field in each cell, which 
contains a count of how many other cells point to it. MAMMA 
count is a positive integer number, with a lower limit om 
zero. Whenever the numerical count in the cell's field be- 
comes zero, then it is available to be used for users AMA 


grams. Cells are returned to the free list as soon as 


zu 


possible after they become 


Maa Sib le: consequently, 


oer page’ COMAS that are inaccessible and unavailable to be 


reused again will NEVER accumulate [Ref. 11: p. 443]. 


Do not confuse available with accessible! In LISP, a 


Seilieiseteeessible if it is pointed at or referenced by 


other accessible cells. The only cells that are directly 


accessible are those cells 


interpreter, such as those 


that are used by the system's 


cells that contain a user's 


Buena that 1S being interpreted and its corresponding 


meeetatemon lists that represent the environments still in 


use [Ref. 11: p. 441]. A cell is "available" if it is on 


me free list, ready and available to be utilized by user 


programs. 


Reference Counts maintain track of the number of access- 


ible references to each cell. Figure 2.2 illustrates an ex- 


we of a Simple LISP structure that contains cells with 


three fields each: the reference count field, and the left 


and right pointer fields. 
In the reference count 

to the free list, it means 

field became zero and thus 


1s an additional operation 


method whenever a cell is returned 
that the cell's reference count 
the cell is inaccessible. There 


required in reference counting in 


MUN SEever cells were pointed at by the cell that has been 


termed "garbage" and returned to the free list must also have 


their reference count fields decremented by one. This is be- 


cause there is one less accessible cell reference to each. 


¿l 


This decrement operation may in turn cause any of these cell's 
reference count fields to also become zero and thus become 
inaccessible and available for return to the free list. In 
other words, decrementing a reference count could be a re- 


cursive operation [Ref. IEEE D 


à I Hy 


Figure 2.2 Reference Count Example. 





, 
a a A 
a mc a TÉ a amg em ms 


There is a modification of the reference count technmE 
suggested by Weizenbaum, that uses "doubly linked" list 
structures. In this method, a reference count is placec n 
in the header of each list (this means that there are no 
reference count fields in individual cells). Doubly linea 


lists were examined because they provide more efficient 


ae 


A trans versing back and forth at will in 
examining doubly linked lists) than singly linked lists. 
MO metiod. the programmer plays a more active role 
MADO MeNe tocation and deallocation of memory. The pro- 
wen ASt remember the rules for maintaining reference 
ewes tor a list of cells so that he can avoid performing 
operations that may refer to a particular list whose 
reference count has reached zero. Additionally, the pro- 
er ican explicitly override the reference count and re- 
a Spec list to the free list even before the reference 
count becomes zero. These are features that have to be 
utilized with caution because, in the hands of inexperienced 
M ammers, they Can cause side effects that are neither 
wanted nor constructive (e.g. dangling references). Weizenbaum's 
proach is excellent for the handling of list structures: 
whenever a list's reference count has become zero, then it is 
End x the end of the free list. [Ref. 7: . 412]. 
However, the time required to find the head may be ex- 
travagant and not worth the unnecessary delay. Besides being 
mme consuming and increasing the programmer's burden, this 
Pererence count technique may prevent part of the list from 
ExcErcrunnedotrocthe free list. Why? A part of this list 
Tien: be required by other lists in the user's program 
Witlesthie remaining part of the list is "garbage"; the prob- 
Bum xh tae part of the list that is still required, 


because this list must now be treated as a separate list with 
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a new reference counter [Ref. 4: p. 501]. These are operas 
tions that are normally not provided in the doubly linked 
pambuacceseotiecomongm OE 
In LISP, as in.other languages, reference counts must 
be maintained accurately and the status of all cells' reference 
counts being kept current. Whenever an additional reference 
to an accessible cell is made, that cell's reference count 
field must be increased by the number of additional references 
pointing to the cell. Similarly, whenever a reference to a 
particular cell is removed then that cell's referenco commis 
field must be decremented by the number of references removed. 
In LISP, there are two ways that a cell can be destroyed: 
(1) A pointer can be overwritten by using an assignment 
operation, or by using the functions rplaca or rp 
(Ref Mr AS 


(2) and the cell containing the pointer itself can become 
inaccessible [Ref. 11: p. 441]. 


In Figure 2.2 (reference count example), there is AMAS 
ibility that no cells could be returned to the free 11S tiem 
if the cells were all inaccessible: -This can happen w mom 
there 1s a cyclefsomennen emim ai S Ure, WA 
means that there is a path from a cell back to itself. D 
reference count system, cyclic data Structumes are NUM 
claimed, and these cells will be lost forever [Ref. 11: p NM 

Reference counts provide a different approach to the 
problem of storage reclamation than the programmer responsi- 


bility technique. While it removes the programmer's buat 
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"ccc censu additional problems. The reference count 
method does not work at all in the case of a circular list, 
EE DM: ubl:ist of itself. Figure 2.3 illustrates 
a Or a Circular list structure. Reference counts 

do not work because the reference count field can never be 
reduced to zero, even when the entire list becomes inaccessible 


Eo dp. 501]. 





A Aa a oe t en 


ÓN 
7 "I 


Panes Circular List Structure. 
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In Pegure 2.3 (cireular list structure) each" coma 


again the same three fields as earlier discussed. This 


structure was created by using the rpllacd tunet ron wites. 


the right pointer of cell D points to cell A. There 15 90m 


and only one accessible path amd that js from the pointes 


that points to cell A from the left direction. 


There are other disadvantages in the reference coume 


method. These are: 


(1) 


(2) 


(3) 


(4) 


It requires an additional field in each cell to som 

as a counter. In small cells this may cost 25% or 

more extra memory space [Ref. 10: p. 495]. Theorem 
ly, the reference count field in each cell must bes 
enough to handle the maximum number of cells that are 

in memory. 


The basic list preeessing primitives which creates 
jects and copy pointers must spend their time updating 
the cells' reference count fields [Ref. 10: p. AM 
This 1s penmsesve ovembead- 


Reference counting does not always free all the Coma 
that are available. Circular lists (i.e. lists that 
refer back to themselves) will NEVER have a reference 
count that will reach zero. This holds true even when 
no other list that is accessible to a "running MON 
gram points to them [Ref. 7: p. 412]. 


Reference counting is unacceptable to use as a memory 
management scheme because there are "unbounded" de- 

lays whenever a cell is returned to the free list CEE 
occurs because all successors of the returned cell 
become "garbage" and should be returned to the free list 
at the same time when that original cell is being re- 
turned [Ref. 12: p. 112]. This again is a vast Ona 
head to pay and may result in non-uniform execution 
emmes: 


Measurements of actual LISP programs show that about 97% 


of all list cells have just one reference to them [Rei 


351]. 


But because of its inability to handle cyclic 


Zo 


structures, reference counting is NOT considered an adequate 


method for concurrent storage reclamation. 


C. GARBAGE COLLECTION STORAGE RECLAMATION 

iiemeitcasamd last method of storage reclamation is 
Maoaces collection” and like reference counting is an auto- 
matic reclamation method. As previously stated, garbage 
Ae ca 1s .tne process of reclaiming unused storage space. 
It is an automatic storage reclamation method that can handle 
Ae data Structures [Ref. 11: p. 443]. The basic 
foo omcoOolmlecct10n method requires an additional field in 
each cell, like that required for the reference count method. 
However, unlike the reference count method, the additional 
field is only a one-bit field. This one-bit is called the 
MEE” bit. The general idea of garbage collection is that 
a program continues to run without returning any cells to 
meree list until no more storage is available (this is 
called REGULAR garbage collection). When this happens, the 
prosa malts and a "recycling" algorithm uses the mark bits 
ton tirst determine or mark which cells 'are "garbage" and then 
E --pscturn to tie free list) all inaccessible or "gar- 
|a cells. Mimbomenedtes Memory space through the reuse of 
peer eteotelyestised cells to allow continuation of the user's 
program. 

Regular garbage collection postpones the problem of 


storage reclamation until the free list cells is exhausted 


"mi 


[Ref. 10: p. 495]. When this occurs. the user's moc maine 
temporarily halted during which @ garbage collector Toute 
determines which cells are no longer accessible to the 
user's program and returns there LT cc 
where they can be reused again by the user's program. A 
simple way of viewing garbage collection is that storage 
reclamation is NOT a problem until there is no more memory 
for users! programs to utilize; and then with all available 
memory exhausted, the garbage collector becomes involved. 
Garbage collection, unlike reference counts, does reclaim 
cyclic structures. Normally, the sarbage collector 159 
eee routine, relatively disjoint from the rest CONS 
list-processing system [Ref. 10: p. 496]. 

Although garbage collection has several advantages, there 
are some problems with using garbage collection in a list- 
processing environment. The garbage collector has to SMN 
all of memory in order to identify "garbage" cells and non - 
garbage cells. This scanning requires signitiicant processu 
time. With LISP programs spending 10% to 30% of their time 
doing garbage collection, it is NOT unusual for a large qM 
program to take 5-6 seconds to perform garbape collect 
[Ref. 5: p. 491]. If these run times were to be multiplied 
by a factor of 2-5 (or more) in order to obtain the corre. moma 
ing "real" run times in a time-shared and interactive SSA 
it can be seen that performing garbage collection can TEAM 


delays that are inconvenient and excessive to users M 
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interactive system; and as programs continue to grow and 
become more complex and as memory sizes continue to expand, 
HEUS c casbape collection may reach the state of 
becoming intolerable. 

Just as the two previous storage reclamation methods 
had their disadvantages, there are some disadvantages in 
aanzie garbage collection. The most obvious one is that 
ansadditironal '"marking" bit is required in each cell. 
ona lily, garbage collection traditionally runs very 
slow when all of memory is in use. This is because the 
garbage collector must scan the entire memory area that is 
Occupied by the user's program in order to identify "garbage" 
cells. In some instances, the number of reclaimed cells 
Mia re returned to the free list may not be worth the 
Ert [Ref. 7: p. 412]. .The unpredictability of when 
mecbaece collection occurs is a difficult design concept in a 
meal-time system. In other words, when does garbage collection 
nem It may not occur and normally does not occur at the 
Me me. every time xt is inyoked. Delaying program execution 
is another disadvantage, and combined with the unpredictability 
m garbage collection can be frustrating and dangerous. This 
results from the fact that the garbage collector is relatively 
tie rest of the list- processing functions [Ref. 
LU: pee], 

As list-processing databases continue to grow, garbage 


collection problems will also continue to surface more 
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frequently and become more noticeable to programmers (i.e. 
garbage collection will take longer and longer to complete). 
Depending on the size of memory, the delay caused by gar- 
bage collection is proportional to the amount of memory being 
usede. I: p. 522E 

However even with the above disadvantages, garbage col- 
lection is a worthwhile and necessary activity, especially 
in a LISP environment. Garbage collection has always Deen 
needed because the amount of addressable space in memory 
has always been much less than the total space both rcs 
quired and used during execution of a list-processing pH 
gram. Thus, garbage coflectton enables a "reusing" of eiii 
system's finite amount of addressable memory. Additionally; 
garbage collection reclaims cyclic or circular data sto e 
Garbage collection frees not only the programmer but also 
most of the list-processing primitives and functions trem 
the concern about storage reclamation; now only those primi- 
tives that create new cells from the free list can invoke 
the garbage collector [Ref. 10: p. 495]. In LISP, the meo 
tion cons requests new cells from the free Tist [Ref MM 
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D. CONCLUSION 
The major problems that arises when attempting to reclaim 
a part of a list structure (i.e. an individual cell ord MEME 


of cells) is knowing which part of the structure is "garbage" 
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Ms still required for the user's program 
ee EEN of the above three methods of 
storage reclamation are responsible for this reclamation. 

Of the three methods, the remainder of this paper is de- 
voted to garbage collection. Garbage collection removes 
the responsibility of reclaiming "garbage" from the program- 
mer and allows him to concentrate on his program. Garbage 
eec eilon also reclaims circular data structures which 
aneen e counts could not handle. Circular recursive struc- 
tures are common features in a list-processing system, such 
ao LISP. 

Additionally, it Nas been convenient to classify garbage 
mec eon according to the size of the cells that are used 
eeehe users! program and reclaimed by the garbage collector 
Meet. 2: p. 343]. LISP cells illustrate the problems that 
are involved in marking and sweeping '"single-size" cells or 
cells that are of the same size (i.e. all cells have the same 
number of fields and each field has the same number of bits 
per cell). This paper will discuss the LISP garbage collection 
MEMOS. CONnscaqucntly, Single-sized cells are assumed to be 
aoe. norm." 

The next Chapter Examines the basic process and techniques 


used in garbage collection. 


al 


III. GARBAGE COL GUON PROCESS 


As discussed in the previous chapter, garbage collectam 
is a method of reclaiming unused areas of the memory that 
are being used and allotted to the users' programs. Garbage 
collection will be referred to as "GC" throughout the rem 
der OL this paper. 

There are two basic phases that constitute GC. These 
have been termed the "marking “and the "sweepumng  phasssi 
Marking is the process of transiting through memory and 
identifying any cells that may be reclaimed. Sweeping 1s the 
process of incorporating these cells into the memory aire 
that is available to the user. This memory area is cam 


the free list. 


A. PHASE ONE: MARKING 

This phase is usually performed with the garbage collector 
maintaining a list of immediately accessible cells; when the 
garbage collector is activated, the links (or pointers) Smi 
connect one cell to another are traced (followed) and every 
accessible cell is then marked. 

In the marking phase, all mark bits are assumed set to 
zero (usually by the preceding sweep phase). The. phase OS 
from the beginning of memory and from these cells tha tem 
accessible to the user's program. Cells that are no lO eM 


required by the program dre called an To save 
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time, a list of immediately accessible cells is maintained 
urhem base collector simply traces the links of those 
EEUNS ure con this list. This ensures that every 
single accessible cell is marked and unmarked cells are 
Maus ready to be reclaimed. 

For illustration purposes, each LISP cell has two fields 
eonan pointers to other cells. Other LISP implementa- 
ee ma have other cell formats. The left cell field can 
m aS ing the function car and the right cell field 
DIS owd by using the function cdr. Each cell addition- 
as tro boolean (one-bit) fields; one that is used to 
differentiate between atomic and non-atomic cells and the 
Omer field is used for marking [Ref. 2: p. 3435]. Figure 


ix etrates sugh a LISP cell. 
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S(sign): “mark” bit used in GC 
0 garbage 


1 non-garbage 
T(type): | 
= Q indicates atomic 
T = J] indicates non-atomic 


Po  polmLers to other cells/lists or NIL 


Note: us word of data associated with this cell (by 
pointers) when T = 0 


IM McCESU  Lioe Cell Format. 
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The "marking" algorithm is recursive and is able tones 
claim circular list Structures oW Buteobevng?recurstvemo 
means that memory space is required (i.e. a stack) in order 
to place and thus save the accessible cells. This sounds 
like a "catch-22" in that the garbage collector was orm 
ly called because there was no more memory area for the 
. users! programs. How then can there be memory available to 
handle the recursive marking algorithm when the garbage 
collector was called? This situation becomes a problem De- 
cause the marking algorithm is operating entirely in mage 
memory and not secondary storage. To solve this problem, 
several algorithms have been proposed. All these algorithms 
reduce the required storage by trading it for longer time 
needed to perform the marking phase (whereas most recurs 
marking algorithms reserve some memory space for a fI 
length stack, which is used for the marking).  [Ref. 2: p. 
Sa 

One algorithm is the Deutsch-Schorr-Waite algorithm, in 
which the cells of a list structure are traced and inspected 
without having to use a stack. This algorithm reverses suc- 
cessive links until either the leaves (i.e. atoms) in the 
StruGHuTe yor Celts that have already been visited are found. 
The link reversal is then undone by reconstructing the CAI 
list structure. Al cells are vost ted mec fines. ME 
additional visit and the overhead to restore all pointers and 


for inspecting and marking the bits render this method less 
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Es newEuMenadlly, this algorithm requires the use of 
ameener Odes mela in cach cell. This bit is called the "tag 
ux Ucstesothe direction 3m which thejrestoration of 
the reversed links should proceed (i.e. whether to follow the 
Eu cor hisporter freld in each cell) [Ref. 2: p. 544]. 

Another marking algorithm that solves the above problem 
is that proposed by Kurokawa. His algorithm uses a fixed- 
Picti staek and a tag bit (similar to Deutsch-Schorr-Waite's 
ADA Wen thé stack overflows, some of the pointers 
from the stack are deletedbut the information is preserved 
by turning on the tag bit of the unstacked cells: these cells 
form a chain (the pointer to this chain is left on the stack). 
Removal of stack cells makes space available to continue the 
marking phase. Whenever a pointer is removed from the stack, 
Me examined to determine whether the cell it points to is 
tagged. If it is, then the linked tagged cells are retraced 
Eer. 2p. “345]. 

Figure 3.2 illustrates the marking phase of garbage col- 
lection. The dashed line reflects the sequential trace 
iRuouvchethe cells in wemory. For additional information, see 
[Ref. 11]. 

By the end of the marking phase, ALL accessible cells have 
been marked and are ready to be reclaimed. The mark phase 
basically determines which cells remain accessible to the 


users' programs and which cells are not needed by the user's 


SO 
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Figure 3.2 aMarkineS Phase pee 
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program and thus are "garbage." This reclamation is the 


AS oie second phase of garbage collection: sweeping. 


B. PHASE TWO: SWEEPING 

The marking phase traced and identified ALL non-garbage 
EUN hanne started from those cells that were im- 
mediately accessible to the user's program. The sweeping 
phase now makes a sequential pass in memory of all the cells 
that were traced by the marking phase and incorporates ALL 
Unmarked cells onto the free list in memory. In other words, 
ALL unmarked cells are concluded to be "garbage" and are 
appended to the free list, in which available cells are 
linked by pointers [Ref. 11: p. 445]. 

The simplest method for reclaiming the marked cells 
Amas tes Ot linearly sweeping through memory. Each cell is 
Visited in order (i.e. sweeping through memory). If a cell 
E unuarked then it is inaccessible to the user and can now 
be linked onto the free list. However, if the cell is marked, 
Ms accessible and its mark bit is reset back to zero 
Mire paration tor the next marking phase of garbage collection. 
The garbage collector moves on to the next cell in order, re- 
peating this process until there are no more cells to "sweep." 
nus, there are two sequential passes through memory: one for 
the marking phase and the other for the sweeping phase. 


The sweeping phase is usually separated into 2 distinct 


subphases. These are: 
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(1) Incorporation of all avaieiabile E linked 
or connected to other celllgeby pointers into aimee 
I co 

(Z) Compaction, which will Dewexplained later, is) whem 
all unused cells are "moved" to One end of memory 
while the other end of memory contains those cells 
that are currently accessible and being itii iien 
users' programs. 

The incorporation of cells being returned (sweeping 
phase) to the free list can be subdivided into specific d BENE 
tion areas, depending on whether compaction is required om 
not. If compaction is not required, then the sweep phase 
consists of removing all cells identified as garbage and 
placing them anywhere on the free list. However, if comp 
tion is used then ALL available cells are compacted INTO 
contiguous area in memory; compaction also necessitates seme 
updating of all pointer references to the cells that have 
been relocated. Compaction prevents "thrashing" and performs 


better (significant time gains) than a non-compacting garbage 


collector. This paper will be concerned with garbage com MD 


with compac tiron 


C. COMPACTION 

As mentioned earlier, when compaction is considered a de- 
sirable feature of the garbage collector, memory is base 
divided into two areas; compaction moves all available COMA 
to one end of memory (containing the free list) and the other 
end of memory contains accessible cells that are being Wee 
by users" programs. There are several areas of concerm NM 
“compacting” GC scheme has tober oncer a SN 
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Io wrexactly where each cell is to be moved to. 


Ces 1n use must be adjusted to the "planned" 
address of the cells that they reference. 


cells in use must be moved to its "planned" 
address [Ref. 13: p. 204]. 


Compacting garbage collectors are more complicated than 
Om Copa ting garbage collectors. Most garbage collectors 
Ro unu nseonpadetuang [Ref. 135: p. 204], because of the 
M aS ed Overhead in using "compaction." Compaction is 
EIU Opt on 1n GC. It can be invoked only when needed. 
Meenougmespeed Of the garbage collector with compaction is 
not crucial, it must be efficient and should make a minimum 
nacusNdsCcon the storage area itself. If compaction is an 
option and is not invoked a great many times, why then should 
Lc concerned with compaction in the first place? 

iets anew ane simpl est version (no compaction), GC re- 
As unused oradrscarded cellis whzle leaving the areas where 
Me cells are located exactly where they are physically lo- 
atea memory. Eventually, a snapshot of memory after GC 
EE ccurred wjll look like a piece of swiss cheese in that 
the memory area will be fragmented. Consequently, there may 
PememeSltuatron where there is no single space or area in 
memory large enough to accommodate or satisfy a user's re- 
quest even though the — of available memory space 


1s large enough. Therefore, one solution to this potential 


problem, is compaction. 
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Compacting memory areas basically means that all unused 
areas of memory are moved (compacted) to one end of memory 
and the other end of memory is reserved for the areas that 
are currently being utilized by users' programs. The resis 
list structure has the identical topological structure QNEM 
old^Steructure "Ret 79s p SANE 

Compacting GC has been shown to have significant time 
gains in terms of performance in LISP programs [Ref. 20M 
545]. Even though GC with compaction is more costly Cie 
GC without compaction, its advantage is that it indirecek 
helps to reduce the number of transfers from secondary storage 
in virtual memory systems* In other words, "thrashing 
greatly reduced. 

The decision whether or not to use GC with compacto 
should be based upon the ratio between the total amount of 
computation time and the amount of time the processor spends 
in GC. If the ratio is small, then compaction is unnecess 
[Ref a 

With the garbage collector maintaining the free 117503 
is possible to incorporate the returned garbage cells in one 
contiguous area in memory through compaction. Compacto 
becomes a concern of the programmer ONLY if program execution 
terminates due to insufficient memory, even though garbage 
collection has taken place. 

There are several types of compaction, classified DIS 


relative positions in which cells are left after compacta 


These are: 
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Ma NEellsmrhet origdnally*point to each other 
o not necessarily occupy contiguaus memory positions 
after compaction.  Arbitrary compaction has also been 
MAN o pointer scheme because the garbage 
collector uses two pointers: one pointer sweeps from 
p address and the other pointer sweeps from the 


Deom address in memory. 


When the top pointer reaches 


a garbage cell and the bottom pointer reaches an ac- 
cessible cell, then the accessible cell is moved to 
the garbage cell's address. The process ends when the 
imempoimtersemcet. This has an additional overhead 
involved in the updating of the pointers but it is a 
te technique to describe and implement [Ref. 2: p. 


eT. 


"LInearizing -- Cells that originally point to each 
ferme Navemaapfacent memory positions after Compaction. 


en Sliding -- Celis are moved toward one end of memory 
without altering their linear or relative order. This 
ES MS applicable for heaps of odd size cells; but 
this technique also requires the destruction of occupied 
A O a n Cers to them have beem updated [Ref.' 


CAOS |. 


(4) Copying -- This is a compaction method in which the 
garbage collector creates a second storage area for 
each cell that is to be compacted and then copies these 
cells from the old area to this second area. This 
method is normally reserved for virtual memory systems 
cecon erore etra memeny areas required for copying. 


D. CONCLUSION 

The compaction method that 
Wee oe assumed to be the type 
the remainder of this paper is 


method. Memory is scanned two 


is normally used and one that 
ROCE Compaction technique in 


Phera er Of arbitrary 


mes. ein the first scan, 


two pointers are used as previously discussed. When an ac- 


oo e M M is moved to its new location in memory, its 


ier -biteisoeturmed off to await the next GC cycle. The 


Boece seaimeor memory 15 needed for pointer readjustment: this 
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is because some cells have been moved and now have new 
memory addresses. It is critical to update all pointer Sma 
any obsolete cell locations. If this was not done then Ame 
dangling reference problem can and usually will occur. The 
second scan also scans only the compacted area.  Pointers 
are readjusted whenever they point to cells that have been 
moved to new memory locations. 

For the remainder of this paper, whenever garbage collection 
is mentioned, compaction is assumed to be one of its features: 
This assumption takes into account the fact that in systems 
that utilize single-size cells, garbage collection compaction 
1s not really a prob leémecrceptwone yi Ss, st ene 

Whether or not compaction is used with GC, the same or 
problem of suspension of program execution remains. HOW this 
problem can be eliminated will be discussed in the tol 
chapter, where parallel (concurrent) GC methods and techniques 


are examined. 
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IV. PARALLEL/CONCURRENT GARBAGE COLLECTION 


Parallel garbage collection is one of the more desirable 
musdsDesmorcelumsnating the periodic and unpredictable 
Pension OL program execution in a list processing system; 
E 0olesicableXun an interactive and real-time system 
oe loop. Il. Por example, being able to implement paral- 
NN arbuse collection in LISP will enable simultaneous list 
mausessinoc and execution of user programs, thus eliminating 
paeeuser having to be concerned about his program halting 
Deeause of lack of memory space. The garbage collection 
process will be taking place without the system's users being 
He of it. 

lE I lt fOr an Interactive and real-time list 
Moa essing environment to provide satisfactory service when 
oemat ons must be halted to allow GC to occur. As programs 
Dome larger, as list processing systems become larger and 
MS complex, as list processing databases become larger, and 
Ane oO takes longer and longer to performs this difficulty 


DSSOmes worse and more noticeable to the users [Ref. 16: p. 113]. 


A. INTRODUCTION 

A GC system that ensures users' programs are never sus- 
pended due to lack of available memory has been termed a real- 
time garbage collection system [Ref. 5: p. 491]. This is be- 


cause it provides real-time responses to each user's request. 
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A real-time GC system avoids suspension of list processame 
operations and does NOT halt program execution.  Clarifica- 
tion between real-time garbage collection and regular garbage 
collection is needed to avoid any confusion in readings about 
garbage collection. The only real difference between csl 
two types of garbage collection is WHEN the GC processes 
curs. In "real-time" GC, garbage collection and program 
execution run concurrently (i.e. at the same time); while in 
"regular" GC, the user's program executes until it runs 
of memory when it halts and waits for the GC process to re- 
claim some memory. 

Attention in this paper is devoted to "real-time" garbage 
collection and NOT "regular" garbage collection. The temas 
parallel or concurrent garbage collection will be used MEE 
changeablly threughout thse paper 

In a computing m that has a significantly large 
memory, which is not an uncommon occurrence today, GCSE 
become very expensive in both time and money due to its re- 
quirements to scan and sweep through this entire large memory 
to identify (mark) and return (sweep) all garbage cells. mim 
an interactive LISP system, WHEN garbage collection is AMOS 
occurring becomes very apparent to the user, especially when 
his program halts and the system then informs him that there 
is no more memory available for him to use. This cana 
very frustrating to the users on that system. How then can 


this programmer frustration be eliminated? As previously 
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Stated one rote possible solutions 1s through parallTel 
Gor bage seollection. 

Was paradlle] approach, garbage collection occurs 
Pimleomeeusly Wren program execution. One method to eliminate 
the proolem of the temporarily halting of users’ programs 
(so as to allow GC to occur) would be to construct faster 
hardware (i.e. processors, buses, disk drives, etc.); this 
would allow faster and larger transactions between main 
Memory and secondary store. In other words, one solution is 
a fast system where paging occurs and where the garbage col- 
meeror is speeded up by implementing it in microcode as a 
Paani tiye Operation of the list-processing computer. However, 
this hardware method does have some limitations. While the 
emaent trend of memory sizes continues to increase, it is 
very doubtful that it is possible to increase processor speed 
to the point of eliminating GC time, so that it is unnoticeable 
Ae users Of the system [Ref. 5: p. 492]. 

pie ven a the garoage ceM@lector were to be microprogrammed 
in the hardware of the computa system, GC would sti l Mpe 
necessary; and the GC process may even still necessitate some 
halting of program execution. Additional methods and tech- 
Menes tO allow GC to occur simultaneously with program execu- 
tion are presently available. If the selected GC method for 
Sa ar system (with or without Compaction) is still de- 
Sed tomhavesthe goal of eliminating the halting of user 


programs due to lack of available memory, then the only 
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satisfactory way to ensure that program execution sm 
halted while garbage collection is taking place is through 
parallelism or concurreney er ome 
The efficiency of parallel GC will play a major role in 

the widespread use of list-processing systems [Ref. 5: p. 
1145]. There are several techniques to implement conem 
GC, each of which are subsequently explained. Because the 
focus of this paper is on non-virtual memory systems RE 
Virtual Memory (VM) approach is only briefly described. 
These concurrent GC methods are: 

(1) two processor method 

(2) time slice method 

(3) dynamic processor allocation method 

(4) Virtual Memory (VM) method 
Be TWO" PROCESSORS METHOD 

In this method, two separate and distinct processors are 

used. The first processor, termed the "collector", is re- 
sponsible only for garbage collection. The second processor, 
termed the "mutator", is responsible for program exec 
[Ref. 2: p. 554]. Having two processors available allows 
parallel operations to take place. In other words, this 
lector can be conducting GC at the same time that the mutator 
is performing list processing tasks. The collector operates 
in such a manner that the free list never becomes empty, re- 
sulting in there not being any noticeable delay caused by 
garbage collection [Ref. 5: p. 492]. Both processors "Sia 


a common memory. 
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GeV Tho "Ger tector” -- The collector performs the pre- 
viously described tasks of marking and incorporating 
(eweepunc)eall unmarked cells into the free list. It 
COS re last structures that the mutator has dis- 
Idem 5: p. 49357. While these GC "tasks are 
occurring, the mutator remains active and continues 
its own operations of computation proper. 

Uta tor” -=="The mutator provides all storage re- 
quired by users' programs but the mutator can not 
n accellsoftrom"the ffee"lrst until the*collector 
makes them available. The mutator "mutates" (processes) 
structures tin memory [Ref. 5: p. 492]. 

fei SLICE METHOD 

ub c:hessmmplest method to eliminate the halting of 
m ram execution wnile GC 1s occuring is to time-share or 
time-slice one processor between list processing operations 
and garbage collection tasks. The GC could be done at times 
that would not be inconvenient or noticeable to the user; 
Wemeas during typing (keyboard inputting) in a interactive 
meeeth Or at Other times that could be setup or scheduled by 
a system clock. GC could be conducted at regularly scheduled 
mund the duration of GC could be determined by the 
is Characteristics (i.e. number of users, time of the 
day, job priorities, etc.). 

If the GC is to be conducted on a regular basis, then the 
duration of each GC cycle could be programmed to be relatively 
IES Would Of Course depend on several criteria of which 
one of the more important is the size of memory. The larger 


the size of memory, the less the number of times necessary to 


be set aside for GC (i.e. the garbage collector would be 
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called less frequently). Alternatively, the system could be 
designed so as to reduce the length of each tamemsiice: Mb 
no matter how often GC is conducted or how long a time slice 
is, it is important to remember that GC could be temporaria) 
suspended at any time and later resumed [Ref. 5: p. 492]. 
This suspension of garbage collection would be controlled 
the system operators and not the system users. Of courses 
this suspension of GC is dependent on the fact that suffrcrent 
memory always remains to allow program continuation. Although 
this would alleviate but not necessarily eliminate the aggra- 
vating problem of users waiting for GC, it would introduce 
still another problem, that of increased overhead in context 
switching, and still may not result in any net gain of Sym 
tem speedup [Ref. T0: p. 490]. 

There currently exist algorithms for both time sharam 
the operations of both the collector and mutator procek EEE 
and the two processor scheme mentioned in the previous em 
These algorithms demand a greater percentage of the.process: 
ing time than does regular garbage collection. Additionally, 
the results from these algorithms have shown that the garbage 
collector with one processor being time-shared wastes (i.e. 
does not use) more processing time than if there were two 
processors dedicated to separate operations and func tkam 
The main reason for this is that the time allotted for the 
garbage collection in a time-sharing environment must Pm 


ceed even if there was no demand for it. A system ts 
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time-shares one processor between list-processing tasks and 
bare colMiSe tion. tasks ¡has specific times set aside for 
both these tasks. These times are created by the system 
Nunc dessen time. Bor example, if a one processor 
Den Mee designed to conduct garbage collection every 100 
muENSSecondscand there was no necessity for GC during one or 
more of these 100 millisecond allotted GC times, then there 
is an apparent waste of processing time; because the processor 
Could be doing productive work (list-processing) rather than 
Mmeye waiting for thesallotted garbage collection time to ex- 
feeem>O it Could conduct list-processing [Ref."2: p. 355]. 
Petite mwdeolon deero10n Ler a particular computing system 
Mama de sto Maye Only one processor to SHARE the list process- 
Meche garbage collection duties, then the dynamic 


meocessor allocation method might be more appropriate. 


IE DYNAMIC PROCESSOR ALLOCATION METHOD 

sc ennaque inwelyes dynamically allocating one processor 
ac Garbage collection tasks and to list processing opera- 
t1ons as real-time needs dictate [Ref. 10: p. 496]. However, 
e trend. ot decreasing hardware costs including processor 
pp Dx n unuEsodtoweurld be more practical to devote one 
Die@eecoemmotniectlyeto GC requirements and devote another 
pau c ciSt-processrHg operations, even if there were 
mmes thar enere would be no need for GC (resulting in one 


processor being idle part of the time). But having ONE 
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processor, being used either in a time-shared or dynamically 
allocated method, would reduce the overall system cost ama 
eliminate synchronization problems between the two processors. 
In addition to the previously mentioned "time-slice MEA 
the other method of using only one processor to conductor" 
bage collection would be to dynamically share this proce 
between GC tasks and list-processing tasks. Steele has stated 
that dynamic processor allocation could be utilized to achieve 
agtaingpert ormamcge seve iker m Ot 

Dynamic processor allocation is similar to the time Sm 


scheme in that there is only one processor being utilize EM 


both GC and list-processing tasks; however there is no con- 
ception of time with the dynamic processor allocation method 


(i.e. the time at which the garbage collector takes control 
of the processor operations is NOT pre-established or pE 
designed). Rather, the garbage collector is only invoked 
WHEN absolutely necessary. The big implementation question 
with this method is this "WHEN." 

The dynamically allocated processor method is dit -IHE 
from the regular GC methods found in most LISP processo 
in that the algorithm that decides this "WHEN" is more complex 
and is developed with special considerations in mind: how much 
memory remains available to the users, job priorities, Simak 
sizes, etc. One implementation method for the dynamic musa 
could be that the garbage collector would be invoked at times 


that the users would not find it harmful to their programas 
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execution. In an interactive system, these times (that GC 
Gallia ceci may besbased on when users are utilizing a 

i ncadcosdb stile or when users first check into the system. 
M ponn is that there is no set or established time for the 
garbage collector to take control of the system in the dynam- 


Poly allocated processor method. 


E. VIRTUAL MEMORY (VM) METHOD 

immo Geescicme, first proposed by Baker, that operates 
co llects garbage Iin a virtual memory system. In this 
Metered, secondary storage is utilized through paging. The 
basic concept is that available memory is divided into two 
mlceesMedesemrspaces. These two areas are allowed to grow 
Mem opposite ends. The sizes of the two semispaces vary at 
execution time and the moving of accessible cells is done 
whenever a new cell is requested [Ref. 2: p. 365]. One area 
(resembling a stack that uses contiguous locations for the 
users' programs) is reserved for the list processor. The 
other area is available for providing new cells (also from 
contiguous locations). These two areas respectively resemble 
the previously mentioned working list and the free list. 

AMA) have shown that the use of a selective 
pL iM S LISP's function that requests a cell from the 
EC cumprove the efficiency of subsequent list- 
o e aG operations [Ref. 2: p. 551]. In a VM sys- 


tem, each time that a cell is requested (i.e. a cons is 
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executed) a fixed number of cells are moved from one semi- 
space to the other. 

Compaction is necessary for garbage collection in this 
method in order to avoid thrashing. For example, it has been 
advocated to keep one free list per page. In a paging CAMAS 
ment, the extra memory that is needed is of less importance 
than the size of the working list. Since the moved cells are 
compacted, page faults are more Iikely to be minimized EN 
moving of cells during a cons execution corresponds toma 
cell tracing in regular garbage collection. This method dis- 
tributes some of the GC tasks during list-processing operations; 
this guarantees that actual garbage collection CANNOT last more 
than a fixed or tolerable time: i.e. the time necessary Mouse 
semispaces and readjust a fixed number of pointers declared in 
the-user's programi |Ref I: P RER. 

For the purpose of this paper, the virtual memory MAMAS 
NOT reviewed in depth nor is it considered a worthwhile task 
to pursue to determine if it is an adequate method form 
rent garbage collection. This decision is based on thc, 
that systems with very large memories currently abound and 


are becoming more commonplace as time passes on. 


E. CONCLUSTON 

The successful implementation of a list-processing system 
with parallel garbage collection provides a strong foothold for 
parallel list evaluation [Ref. 15: p. 8]. List process E 


list evaluation is exactly wat DE SPODHSUEMS CES 
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Nic SP” obs may spend 10%-30% of their time 
mm carbacewcoltecction tasks. Running GC in parallel could 
cut the total real time for a given task by close to this 
amount, without requiring the user to plan explicitly for 
NS. Since the cost of CPUs has Deen steadily drop- 
Mee It would be practical to devote one processor to GC 
aula be Tale part @f the time [Ref. 10: p. 496]. 

Pariowmeayiamle processor allocation and time-sharing 
processor allocation have some advantages and disadvantages, 
the Two Processor implementation method is the one that the 
remainder of this paper is devoted to. The complexity of 
the algorithm for the dynamic processor method and the likely 
peeetbility that the garbage collector may at times not be 
Performing any useful operations in the time-sharing method 
peeetude Cither of these two methods from providing the 
mama solution for a parallel garbage collection scheme. 
Aito nal, the VM method is not pursued because of the ready 
availability of computing systems with large memories. 

Niche current trend Of decreasing hardware costs, it 
A c2! 3nd cost-effective to utilize the Two-Processor 
Mine” processor devoted to list-processing tasks and the 
Other processor strictly devoted to garbage collection tasks. 
This decision takes into account the advantages and disadvantages 
offered by the other methods. Both the two processor method and 
the time-shared method require at most twice as much processing 


Dove a a aee collection, but, since the list 


a 


processor no longer has to perform garbage collection ens 
is a net speedup in the list-processing operations [Ref. 5: 
p. 492]. As the cost of processors is steadily decreed P Em 
the feasibility of conducting GC with two processors should 
increase with time. As the cost of hardware continues to 
fall, one more processor in a system would not be infeasible. 
In fact, having this processor devoted entirely to G@ Wome 
in the long haul provide more advantages and would alleviate 
user frustration; but more importantly, it would eliminas 
the halting of execution of user programs while GC is dam 
place because of lack of available memory. 

The specific operation of the mutator processor ¡nm 
succeeding chapters will NOT be discussed in detail because 
basically it is just a typical, currently existing type Ae 
and studying its operation would reveal no significant or 
new techniques for the operation of the collectar. 

Instead the collector processor is examined because it has 
unique and different characteristics and is devoted enm M 
to the task of garbage collection. Synchronization between 
the two processors will MELIUS ad. 

Because of the numerous algorithms and implementations 
that currently exist for concurrent garbage collection jae 
one hardware and one software method of the Two Processor 
method will be examined. However, the selected hardware and 
software methods are both considered to be the ones that are 


used in comparison with other garbage collection methods. 
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These two methods for two processor allocation with concur- 


e pase scollection are examined in the next chapter. 


EI 


V. "TWO PROCESSOR" GARBAGE COLLECTION IMPLEMENTADO 


A garbage collection cycle is defined as the total Jean 
cution time required for both the marking and sweeping NE 
claiming or scanning) phases. For the concurrent GC SySMENM 
there exists a requirement (primarily system efficiency) to 
reduce the length of the garbage collection cycle [Ref. 16: 

DII Ds 

Implementing concurrent GC in a LISP environment wist mE 
Two Processor Method raises an important question: Is it 
feasible to utilize hardware or software techniques or possibly 
a hardware and software combination to achieve concurrent 
garbage collection? This question will be examined in this 
chapter. 

There are numerous methods and algorithms proposed for 
implementing a parallel garbage collection with two processors. 
Some methods are: Ben-Ari's two-color scheme [Ref. 17], 
Dijkstra's three-color schme [Ref. 18], Kung and Sone" Sagi 
color scheme [Ref. 3], and Steele's special coded or micro- 
coded scheme [Ref. 10]. However, this paper will only 
concentrate on reviewing two classic approaches to parade 
garbage collection: 


(1) Dijkstra's Software Implementation Method 
(2) and Steele's Hardware Method. 


These two methods were chosen for review not only because 


they represent some of the earliest ideas on the problemen 


56 





COIE tion but also because each proposed a different 


eype Of Solution. 


A. DIJKSTRA'S METHOD (SOFTWARE APPROACH) 

Das algorithm for parallel garbage collection re- 
Mee odas tinct and separate processors, called the mutator 
Ane collector. Both processors operate from and on a 
common memory from which cells required for a user's program 
are allocated and deallocated. The collector continually 
executes a two-stage cycle while maintaining the free list 
Cenan lable cells for the mutator [Ref. 3: p. 1144]. The 
Mor 1s dedicated to executing the-user's list-processing 
program and the collector is responsible for garbage collection 
tasks. 

With two processors operating on the same memory there are 
Eto De coordination and synchronization problems whenever 
the collector is marking the in-use cells on the working list 
Ae same time that the mutator is modifying cells on the 
Eme working list. One solution to this problem (and the one 
AnD kstra's approach proposes) is to utilize colors, which 
Nes tatustot each cell; 1n particular, a third color 
(gray) is used during the mark stage to ensure all in-use 
STD mao ed. 

Dijkstra's software-oriented method for concurrent GC with 
BDNEDIOSCSSODsshas sometimes been called "Dijkstra's three- 


ao ce iitec tion algorithm [Ref. 35: p. 1145]. In 


5 


other literature, it has been referred to as an "ong blc 
garbage collector. On-the-fly garbage collection 25050 ae 
tem that allows concurrent execution of both the mutuo 
and the collector WNNONe 17: p. 34M" 
The three colors for cell marking proposed by Dijkstra 

are: 

(1) "white": indicates unmarked or unused 

(2) "black": indicates marked or used 


(5) and "gray": indicates that the cell has been regum 
to be used by the user's program [Ref. 2: p. 554]. 


Garbage collection is performed by a two-stage cycle (i.e. 


a two-phase algorithm): the collector marks all cells a LLL 
ible from the root and then appends all unmarked (and hence 
inaccessible) cells to the free list. The basic algorithm 
continually executes mark and sweep phases that are Summi 
to the phases discussed in Chapter 5. The algorithm that is 
used applies a simple scan and mark method for mark propaga- 
tion. However, the marking phase is a little more compu 
than the mark phase in regular GC systems because two mark 
bits are now required instead of one mark bit. Two mark bits 
are used because a cell can be in any of three states, which 
correspond to the above three colors. | 

A description and illustration of the collector's iam 
phase and collecting/appending phase is provided later in 


this vea tr 
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Each cell in memory contains the following: 

G Adata field. 

a xt contaim pointers to other cells. 

MES uM nobit color field that is used by the collector 
ee le enther black, white, or gray [Ref. 3: p. 
11447] . 

Nos trates a typical cell. Notice that this 
NEN NPMerent cell configuration than that in Chapter 5, and 
Ot mecessarily tne only configuration of a typical LISP 
celk. 

All cells contain the two extra bits for marking the cell 
enek, White or gray. During the mark phase, the collector 
blackens all Gcimiceiniat amemsaeceessible to the mutator. During 


the scan phase, all white cells are returned to the free list 


and all black cells are whitened. 


Left Pointer Right Pointer | 





Note: Color field is 2-bits 


eeure 5.1 Three-Color Cell Format. 
Located within main memory, which is common to both 


processors, are two distinguished cells: 
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(1) One cell points to the beginning and end (TCM 
and tail) of a linked list of cells that are aval ME 
for users' programs. This linked list 1s summam 
the previously discuss cd EE 
(2) The second cell also points to a list: a working list 
of cells that are currently being used by the mutator. 
A cell is accessible if it is located within the work- 
ing list; and a cell is in-use if it is either acr PL 
ible or located on the free list [Ref. 3: p. 1144]. 
In a compacting GC system, these two lists are locais 
at opposite ends of memory. The mutator and the col le@iam 
tasks and responsibilities are subsequently discussed. 
l. Mutator Tasks/Responsibilities 
The mutator uses primitive operations that alova 
to change a pointer of any accessible cell to point tomm 
other accessible cell, including NIL. The mutator operations 
include pointing to a new cell that is removed from them 
of the free list after that cell was requested by a user's 
program [Ref. 5: p. 1144]. In essence, the mutator SEX 
sponsible for conducting the list-processing operationum 
quired by a user's program. The mutator is doing useful work 
while the collector is collecting and recycling garbage mM 
to pemmit thew reuse by “the muta tome 
The color gray is used during the mark stage TOMAMOS 
that all accessible cells are marked. The mutator helps 
marking phase of the collector by changing a "white" COMAS 
"gray" when that cell is requested and used by a user SEDE 


gram [Ref. 2: p. 3554]. Cells that are black are used DAME 


mutator to perform list-processing in the user's progama 
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DENN NEM M xs whmte (1,e. cells that are not being used 
EN dguestedEhrom the free list by the mutator are white). 
Cells, that are no longer being used by the mutator and have 
not been already reclaimed by the collector, will eventually 
Wewceturned to the free list. 

For synchronization purposes (between the mutator and 
the collector), there is a restriction placed on the mutator's 
mera tios. During the marking phase, no cell can ever be- 
Come “lighter” [Ref. 18: p. 969]. In other words, whenever 
Mator changes a pointer in the pointer field of a cell, 
meteecel] must be shaded. Shading is a primitive operation 
that te cell tO grav. It has no effect on black 
EN or other gray cells [Ref. 5: p. 1144]. 

The mutator will initiate an interruption and will 
Econ ly when the free list is reduced to one cell. It 
S list processing when the collector returns additional 
cells to the free list (i.e. the free list is now greater 
than one cell) [Ref. 2: p. 354]. It has been determined 
that the probability that the number of available cells on 
HE s wfl ever contain only one cell is very low 
[Ref. 17: p. 336]. This is because the algorithms for the 
umxoIxEEtNeccollector are designed to avoid this situation 
BuU cscoct0bhe two distinguished" cells that exist 
in memory. 

Acs is done at one end of the free list 


and appending of garbage cells occurs at the other end of 
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the free list. The algorithms fer they two processors sae 
designed to take into account the required synchronizat mom 
between the mutator and the collector so as to avoid having 
only one cell on the tree list. Butmii thelfree 1r rE 
ever reduced to ONE cell, then the mutator would have to 

wait until the collector returns more cells to the free 
Again, this situation will occur very infrequently. 

The mutator's execution can result in cells that NE 
neither on the working list (pointers are now removed) or 
are NOT yet on the free list. These cells are the "gamm pepumEi 
cells which will be reclaimed by the collector. 

2. Collector dS s ReSpoNmS NN 1 times: 

The basic function of the collector is to Idem 
"garbage" cells and collect them at the end of the free 
in memory by repeatedly executing the two-stage cycle. This 
cycle first marks (blackens) all cells that are in-use. Then 
it appends all the unmarked cells (white cells), dur" 
linear scan of memory, to the free list; next it unmarks or 
whitens the marked cells (black cells) in preparation for the 
beginning of the next GC cycle [Ref. 3: p. 1144]. FICUS 
and Figure 5.5 illustrate the above two-stage cycle of the 
collector.  Appending a garbage cell to the free list is the 
collector's ONLY modification of the existing topology EBENE 
the sWape of the"data structure) |Ref. 1: Tp 902S 

How are the collector and mutator synchronized 7am 


other words, how is it possible to ensure that the collector 


62 





No snoot chance a nongarbage cell (black) to white 
or does not change a garbage cell (white) to nongarbage 
MAC JAS Dre y Tous ly mentioned, the algorithms are designed 
to ensure this necessary synchronization. Additionally, the 
menor tne thira color (“gray”) also helps in providing syn- 
Chronization between the two processors. 

Hero are sone CONStrdints placed on the mutator. 
N:IENUMPFSPOr can only shade cells (i.e. white cells to gray 
cells) and it can only change pointers in the cells' fields 
ma ont ONLY"to other already accessible cells. 

At the end of the marking phase, there are no gray 
cells. The absence of white reachable cells prevents the 
mutator from introducing gray cells while the absence of 
gray cells prevents the collector from doing so [Ref. 18: p. 
weep. 6 lhe Markine and the appending phases of the collector 
mee reviewed next. 

a. Marking Phase of the Collector 

INM as tor colors: the used cells of 
Is pros ram black, the cells in the free list white, and 
AECE is gray. 

Mitially, before any cells are allocated to a 
user's program, all cells are white [Ref. 18: p. 969]. The 
COlteCCuCmeaceotplasies the marking task by initially "graying" 
the first used cell on both the working list and the first 
cell on the free list (i.e. the roots of both the free list 


and the working list). The collector then selects a gray 


cell and tracing proceeds by the collector graying any other 
cells that are linked to this gray"eell (this selection 
the initial gray cell is dependent on the particular algorithm 
implementation, but the simplest method to implement is to 
have the collector utilize the first cell it encounters dum 
ing its sequential scan of memory). Those cells that ane 
linked to this gray cell, including the initial gray Cola 
are then "blackened." All cells that are still white after 
the marking phase is completed will be "garbage" [Ref. 18: 
Dp Figure 5.2 provides an illustration of the abs 
operations. 

Tracing through memory is complete when all the 
white cells are incorporated into the free list and alr IR 
cells are whitened in preparation for the sá mark phase. 
During the next cycle of the collector, these white GOMA 
are swept to the tail of the free list [Ref. 3: p. 1144]. 

Every garbage cell is eventually appended 
the free list. In other words, no garbage cell will ever 
remain unreclaimed or uncollected for more than two Game 
secutive garbage collection cycles. With a GC cycle con- 
Sisting of a marking phase followed by a sweeping/appending 
phase, it is impossible to guarantee that each sweeping phase 
will collect and append all garbage cells that existed dt 
its start. This is because new garbage could have been 
created between the current sweeping phase and the preceding 


marking phase [Ref. 18: p. 968]. 
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Figure 5.2 displays memory with respect to time 
I xcscoleetor performs the marking phase. The only cell 
EIN lu Hrstedun Figure 5.2 (and Figure 5.3) are the 
Eu bupster fields. Cells are either initially 
Mc List Gr the working list or are "garbage" that 
Ec .Pbsen appended to the free list. Cells 1 and 2 
are marked garbage cells ("marked'" garbage is explained 
iS im this Chapter). Cells 5-5 constitute the initial 
ae list, cells 6-9 constitute the initial free list, 
and cells 10 and 11 are unmarked garbage cells not yet ap- 
pended to the free list ("unmarked" garbage is explained 
Ex un this chapter). Cell 5 is the starting cell (initial 
Geme tor the mutator and cell 6 is the starting cell of 
the free list. 

Thevvarious subphases of the collector's marking 
Phase (as illustrated in Figure 5.2) are: 

(1) Subphase A is the beginning of the mark stage (see 
Subphases G and H in Figure 5.3 for the reason why 
eu] is initially guay). 

e Subphase B grays the roots (cells 5 and 6). 

(3) Subphase C indicates the halfway point through the 
mark stage (grays the sons of the root and blackens 
all grays). 

(4) Subphase D darkens cells (blackens gray cells and grays 
white cells) through the use of primitives. This sub- 
phase operation is performed to ensure that all used 
cells and all requested cells are marked in preparation 
tor the collector's sweeping phase. 

(5) Subphase E is the end of the mark stage. Cells 10 and 


11 are "unmarked" garbage cells which will be collected 
during the scan phase of the current GC cycle. 


65 





a a ia A 


— 0 5 o a M 


A E. YE A e 


Legend: 
W-white 
EQ 
B-black 

Ce es 

S1 NES 


Ml 


Ca 


W 


W 


W 


mita 
meter 


= = A ee 


pomme 


Quom 
me 





oL 
one 





ike Mie are OL 
tiros Caan Eh 


V» cm m m A Y ae ee ee ee eS ‘Mie {X 


66 


6 


D 


D 


W 





ee ee 


5.2 Memory Srayshot Wu rme in eee: 





eee 


B 


— =— — o 


Ie ra tions ot the collector and the mutator 


Preserve Dijkstra's invariance property: 


"Every white in-use cell can be reached from a gray cell 
along a path passing through white cells exclusively." 


meen eee present implies that all in-use cells are 
marked black; and when this happens, it signifies the end 
of the mark phase [Ref. 3: p. 1144]. 
b. Collecting/Sweeping Phase of the Collector 

Mii TS Din kstra's scanning phase. It is also 
the collecting, Sweeping, and the appending phase. This 
puce returns garbage cells to the free list. Although it 
was not originally designed to include the compaction feature, 
hime vd so be inplementéed to relocate cells for compaction, 
med all pointers being updated as necessary. At the end 
utbs phase, all unmarked cells are eventually returned 
AUT” Mies Mist and Compaction of cells can occur as they 
enned to the free list. 

When ene marking phase Ts. completed, there exist 
two different types of garbage cells: ‘ 

(1) Unmarked garbage cells which are collected during the 
son pasase Of the current GC cycle. These cells are 
called "quick" garbage. 

(2) and Marked garbage cells which are not collected until 
dee an phase of ehe next collection cycle. These 
CARa e called "slow" or "tloating” garbage [Ref. 3: 
p. 1144]. 

Differentiation is made between the two types of 


garbage cells to illustrate that there may be occasions that 


some marked cells become garbage while the garbage collector 
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is still in its marking phase: these garbage cells willl 


remain uncollected until the next garbage collection ereis 


Memory is examined in a linear or sequential 


fashion. White cells are now appended to the free list and 


DICK 


phase. 


phase 


where 


cells are whitened in preparation for the next mark 


Figure 5.3 illustrates the collector's scan 
in memory with respect to time. It shows memory from 


the last mark phase ended (see Figure 5.2) to the be- 


ginning of the next mark phase. 


The various subphases of the collector's cole 


ing/appending phase (as illustrated in Figure 5.5) are: 


CH) 


(2) 


(5) 


Subphase F is the end of the mark phase and the begin- 
ning ofc cc Sc 


Subphase G is the halfway point through the scan phase. 
The right pointer from cell 5 to cell 4 was dromum 
and cell 3's left pointer adjoined while cell 4 was 
still black (hence, the mandatory shading operation 
had no effect on cell 4). If this pointer mam pue 
tion occurred after cell 4 was scanned and whitened, 
cell 4 would now be gray rather than white [Ref. 3: 

p. 1145]. This situation occurs because of some 
primitive operations in the user's program that jae 
taking place at the same time as garbage collection is 
ECU 


Subphase H shows memory immediately after the scan 
phase. Cell 3 is gray because the right pointer mam 
cell 4 to cell 3 has been dropped and cell 4's left 
pointer has been adjoined; this is the same reason 

why cell 1 in Figure 5.2 (Subphase A) 15 initia 
gray. Cells 7 and 8 are now being used by the mutator 
(i.e. the user's program). This subphase is also the 
beginning of the NEXT mark phase if compaction SN 
a feature of the garbage collection process. Wikii 
cells 4 and 5 ("marked" garbage cells) will be appended 
to the free list during theme Ts cic. 
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(4) Subphase I illustrates memory if gompaction were a 
feature of the svstem. Caisse 6m 7, “Ut. 1, and 2 
represent the Smee 115t wien cell 6 Per: the head 
of the free list and cell 2 being the tail of the free 


list. White cells 4 and 5 ("marked” garbage cells) 
will be appended to the tree list curing the next G&G 
cycle. Cells 7 and 8, are on the working J7st. seems 


3 is in the same situation as ceil 1 im Figure SN 
(Subphase K). 


3. Conclusion of Dijkstra's Approach 


Dijkstra's method uses software techniques: two bit 


Un 


per cell are used for coloring or m&rking of the cells 3M 
memorv. No use of semaphores or other interprocessor com- 


^ 


munication is needed with Dijkstra’ 
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The mutator and the collector are separate processors, each 
With specific tasks and responsibilities. The synchronization 


and coordination features between these two processors are 


t 


incorporated in ihe aes the 


r4 


ign o espective algorithmes 


* 


hat ot regular garbdage collection. 


ct 
n 


which are more complex than 
The algorithms' descriptions in this paper have been grecia 
simplified. The purposes of discussing Dijkstra's method 
are to provide a view into one classical method of conducting 
Concurrent garbage collection and to show that concurrent 


garbage collection is possibie bw using software techniques. 


ry 


FOr additional information, see [Ref. 1$]. 

The next section examines a different approach: that 
of using hardware techniques for the impiementation of con- 
current garbage collection with two processors. It is not 


the objective of this paper to compare the two method 


ui 
QO 
Ett 


using two processors to accomplish concurrent garbage 


r 


Ateo one teado it is the intention to determine the 
Beeb ilityeot Using software or hardware methods to accomplish 


writs goal. 


B. STEELE'S METHOD (HARDWARE APPROACH) 

Steele's algorithm for parallel garbage collection, like 
IN omopproOacn, uses two processors, called the list 
E or and the garbage collection processor. While 
BEuStra's method does not explicitly provide for compaction, 
MM UELUT could be implemented, Steele's method does ex- 
pieitiy allow for compaction. Compaction is done using 
erewtwo-pounter technique previously discussed [Ref. 2: p. 
354]. 

PschEproceseeor has its own stack, which are used for 
Menperary storage and recursion. The list processor's stack 
W used for: 

MD List structure manipulation. 


(2) and Temporary variables, such as local variables, for 
LiSP Sanctions. 


The garbage collection processor's stack is used for: 
Smarking of the accessible list structure. 
aime recursive tracing [Ref. 10: p. 497]. 

One ot the goals with Steele's approach was to keep the 
MOSES sor. s overiead to a minimum by using necessary 
synchronization features between the two processors [Ref. 19: 
p. 50]. Semaphores are used for this interprocessor communica- 


Crone aneesymeh HOM ateion . 


dl 


Both processors operate from and on a common memory from 
which cells required for users' programs are allocated and 
deallocated. The cells that are located in memory are similar 
to the cell's configuration previously discussed: the Tecc ME 
fields may contain pointers to other cells. 

Steele introduced a new concept in the internal organiza- 
tion and representation within memory. Cells are organmo 
into sets called spaces. A space is nothing more than 
ordered sequence of cells [Ref. 10: p. 497]. The cells are 
of the same size and of the same format; in other words, 
thes ccu NC homogenous and therefore the spaces are also 
homogeneous. There is noeconsideration of cells of amu 
sizes. 

Steele's garbage collector makes exclusive use of sema- 
phores and requires two bits per cell: the mark bit amemus 
flag bit. These two bits are used NOT only for the regum 
marking of cells but also for compaction and readjustment 
of pointers in the celis' fields Ref. 2 m R 

A "space" has two pointers associated with it. These 
pointers are called freelist and lastfree, which respective- 
ly point to the first and last cells in a linked list moii 
available cells within that particular space. These two 
pointers resemble Dijkstra's two distinguished cells located 
in memory; except thate there may be many freelist and last- 
free pointers depending on the number of spaces; while in 


Dijkstra's approach, there are only two such distinguished 


^d 


cells. These two pointers are actually structures that each 
have two components: 

(Ly A space Indicator. 

(2) and an integer which is a valid index within that 
space's sequence of cells. In other words, this 
integer identifies a particular cell within a space 
[Ref. 10: p. 497]. 

Eust theospaces is predetermined by the system 
designers: it is a system design decision. The concept of 
these combined spaces resembles the notion of the free list 
mathe working list previously discussed. 

Hnc uceHuronizataon between the two processors andthe 
Mies of the two processors is subsequently discussed. 

a Ooni zation of the Two Processors 

As it was with Dijkstra's software approach, the 
Ex chronaization cof the two processors is necessary because 
A C processorn may be relocating cells within a space 
(which is required for garbage collection and compaction) at 
the same time that the list processor is trying to operate on 
these same cells. The way that Steele's approach handles 
Mo roblem is through the use of the flag bit and semaphores. 
This flag bit is the key principle that allows the garbage 
A rocessor to relocate cells with the list processor 
und e oE rto lReft. 10: p. 498]. . Before the list processor 
pomum O ration on a cell, it first must check the flag 
bit of that cell. If that bit is set to true then it means 


that the cell has been relocated and that the new address 


uncta uEnestrrst component field of that cell 


706 


[Ref. 10: p. 498]. Steele calls this process "nommal1 zamena. 
(to assist in the processor synchronization, Steele mas 
porated the normalization process into a single function). 
The garbage collector processor sets the flag bit atten 
Locating ther ell. 

Additionally, the "P" and "V" semaphore primitus 
defined earlier by Dijkstra are used for synchronization 
[Ref. 10: p. 498]. Steele carried the idea of the above 
semaphores to a higher degree in that more complex synchron- 
ization operators were defined (in terms of the "P' and "V' 
primitives). These semaphore primitives permit exclusive 
access to a single cell for only one processor at a time 
in doing so the entire space where that cell is locam c MD 
NOT locked out to the other processor. 

semaphores are used to interlock access to themes 
lists. For efficiency and to provide a greater degree of 
synchronization, it is necessary to have a freelist inter- 
locked as little as necessary. Synchronization is accomp em 
by having a pointer. to the last cell of the free list amm 
stipulating that the garbage collector processor can one 
append cells to the end of the tree tise anid een ema 
processor can only remove cells from the beginning Gi aime 
free list. With this method a free list itself is inter- 
locked with a semaphore ONLY when the free list is reduced to 
one cell. This situation will occur very seldom in pregame 


¡REIS o OT, 
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Eacumpmeecessor adlso has a global register which con- 
tains the address of the particular cell that a processor 
Wants exclusive access to. If the first processor tries to 
use that cell before the second processor has released it, 
then the second processor will loop (busy wait) until it 
Gan have access to it [Ref. 10: p. 498]. Furthermore, it is 
necessary to keep the garbage collector from shifting from one 
Mase to another phase while the list processor is trying to 
determine what phase it is in. To accomplish this goal, 
Steele uses two global "variables", gcstatesem and gcstate 
eee former 15 a Semaphore controlling access to the latter, 
which in Buen has as its value: mark, relocate, update, or 
usu utsestatesem" were executed by the list processor, 
meee garbage collector is prevented from changing phases 
Meer. 10: p. 499]. 

Ass cells, pernters, and semaphores can be 
operated on by either processor. However for synchronization 
purposes, some constraints and restrictions must be placed 
oe processor operations. Neither" processor can push or 
pop cells from a stack at the same time that the other 
processor is doing so because this could result in erroneous 
stack indexes. The solution to this problem is to use another 
semaphore with each stack. Additionally, the list processor 
can access and modify ONLY accessible cells. This is accom- 
plished by having the list processor follow the following 


rule: 
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If the list processor modifies a marked cell during the 
garbage collector's mark phase, it must ensure that the 
garbage collector reexamines that cell and any other cells 
that may be affected by the ormeaial cell modirvedatren 
AA Oo 
Each processor has its own internal registers that came 
used as "temporaries." However, neither processor can ex- 
amine the other processor's internal registers [Ref. IMAN 


498]. 


2. The Garbage Collector m 

The two bits in each cell (the "mark" and the "flag" 
bits) determine the current status of a particular cell 
respect to the current phase of the garbage collector. MISAS 
two bits and the use of these bits by the two processgm mM NL 
vide one of the biggest reasons why concurrent garbage col- 
lection is possible. 

Table I provides a summary of the meanings of these 
two bits. For additional information, see [Ref. 10]. 

The garbage collector repeatedly executes in sco 
the following four phases that comprise garbage collection 
with compaction: 

(1) Mark Phase -- A simple recursive and trace method for 
locating all accessible cells. Through the use of 
one of the global variables, the garbage collector 
processor locks out the list processor during tiii: 


phase. 


(2) Relocate Phase -- Uses the two-poimter scheme PAN 
viously discussed (see Table I). 


76 


Mark bit 
Flag bit 


Mark 
phase 


Relocate 
phase 


Update 
phase 


Reclaim 
phase 


TACLE I 


false 
false 


Cell not yet 
seen by mark 
and trace 
routine. 


Discarded 
cell.May be 
used to re- 
locate an ac- 
cessible ob- 
ject into if 
necessary. 


Discarded 
cell. Ignored 
by update 
phase. 


Discarded 
cell. May be 
returned to 
freelist. 


false 
true 


(Does not oc- 
cur during 
mark phase) 


Relocated 
cell. First 
pointer com- 
ponent indi- 
cates new 
location- 


Relocated 
cell. Ignored 
by update 
phase. 


Relocated 
cell, now dis- 
carded. May 

be returned 

to freelist. 


2 


true 
false 


Cell seen by 
mark and 

trace routine 
Cell is there- 


fore access- 
ible. 


Accessible 
cell. May be 


relocated 
into new 


place 1f 
necessary. 


Accessible 
cell. Pointer 
components 
may need to 
be normalized. 


Accessible 
cell. Ignored 
by reclaim 
phase. 


MEANINGS OF THE MARK AND FLAG BITS 


true 
true 


Cell on 
freelist. 
Should not 
be seen by 
mark and 
trace rou- 
tine. 


Cell on 
freelist. 
Ignored by 
relocate 
phase. 


Cell on 
freelist. 
Ignored by 
update 
phase. 


Cell on 
freelist. 
Ignored 

by reclaim 
phase. 


(3) Update Phase -- Sweeps over each space and normal 
all pointers in all accessible cells and also on@ar 
list processor's stack. Combined with the melocame 
phase, this constitutes the required compaction. 


(4) Reclaim Phase -- Sweeps over all spaces and looks for 
cells with their mark bit set to false (see Table I). 


a The List Proe <0 
The list processor accomplishes its task of being 
able to continue list-processing at the same time thats 
bage collection is taking place through the creation aud 
of list manipulation primitives. Some of the more comm ms 
these primitive -E 


(1) Creation of new cells from the list available cells. 
In LISP, this is accomplished by eons. 


(2) Selection of cells’ components. In LISP, this MED 
complishesmbacorcand ‘edie 


(3) Determination which space a cell belongs to. In LISP, 
this is atome 


(4) Comparison of pointers,where pointers identify a particu- 
lar cell within a space. In LISP, this is eq IRo REENE 
DU P 

CE CONCLUSION 

Steele's approach provides a method to achieve concurrent 
garbage collection using specialized hardware (i.e. micro- 
coded processors). His use of semaphores to assist in syn- 
chronization is in itself not a new concept. But his MSM 
the flag bit that basically serves as an indirection marker 
that permits the garbage collector processor to relocate NE 
within the same space as the list processor is operate 


a new idea. 
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Do approaches to achieve paraliei or con- 
un ---——---co0llecet(yon have been briefly discussed. 
PagikctGa semethod proposes software techniques while Steele's 
approach proposes hardware techniques. Which method is bet- 
tr depends on many factors, such as the size of the system, 
Sspectally main memory and secondary store; the speed of 
the processor(s); the number of users on the system (average 
and worst-case); which high level language is being used 
A ate ish}; the cost of the system, etc. 

eer) illustrates the major characteristics of the 
SaS. Steere's algorithm, which is more complicated 
nan Daykstra's algorithm (mainly because of its compacting 
NNNM) requires all active cells to be bound in a 
Stack. Dijkstra's algorithm uses multicolored marking (white, 
wand Diack). The gray nodes inform the garbage collector 
Ds a cation occurrences. The garbage collector 
counts not-gray nodes during the mark propagating phase. The 
ee nee of gray nodes, when the garbage collector examines 
Nu cunto undicates that the list processor has performed 
Some Inst modifications. Dijkstra's algorithm is obliged to 
apply a simple scan and mark method for mark propagation (be- 
E 5t correetly count not-gray nodes) [Ref. 15: p. 
EE CxESddrtjonal information, see [Ref. 2]. 

I —xceonmmendegd that a technique using hardware tech- 
nology be utilized to implement concurrent garbage collection 
because hardware costs are continuing to decrease, while 


sottwane costs are continually increasing. 
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TABLE II PARALLEL AND REAL-TIMESGARBAGE COLLECI ie 


ALGORITHM STORAGE REQUIRED 
Dijkstra No stack; 2 DiC MB 
Steele Stack; Z Dits Cola 


several semaphores. 


COMMENTS 


Uses a free list; 
Main objectivo 
to prove corredi: 
ness. 


Designed to be 
microcoded; does 
compaction. 


It has been shown with the two previously reviewed 


garbage collection methods of Dijkstra and Steele that con- 


current GC is feasible, including having the feature of com- 


paction. There are numerous other methods and techniques 


implement concurrent GC. The next chapter briefly dg T 


at some of these other methods and the future possibilities 


of concurrent garbage collection. 
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ERA LGORITAMS/ FUTURE POSSIBILITIES 


le. INTRODUCTION 

Powe FOr concurrent garbage collection are too 
numerous to list, let Bue det ribe. Instead, this chap- 
Ex brietly introduce several other concurrent garbage 
collection methods and some future possibilities in the 
m cotosurbace collection. These methods are not necessarily 
restricted to two processors nor do they have compaction as 
EN uure- Additionally, the configurations of the LISP cells 
for. these algorithms differ from those presented in this 


paper. 


B. OTHER ALGORITHMS 

NIE a S that proposed by Lamport. Al- 
though efficiency of garbage collection systems has been 
gcally ignored in this bn it remains an important 
Mou Neration in the design of any computing system. 
Lamport informally proposed that possibly the best way to 
improve the efficiency of a garbage collection system is to 
ELM uns processors for garbage collection.  Lamport's 
algorithm allows any number of processors to be used in a 
GC system, both as mutators and as collectors [Ref. 3: p. 
LESS 

proposal ot using multaple list processors for 


concurrent garbage collection solved two problems: 
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(1) Concurrent execution of multiple mutator processcóN 

(2) Increasing the speed of the garbage collector. 
These two problems were solved through the use of constramums 
placed on the processors and through parallelizing thie ccm 
bage collection phases.  Lamport's goal was to keep the 
overhead of the mutator to a minimum; consequently, no un- 
necessary synchronization between the collector and the 
mutators was introduced [REFIERO 

Lamport's algorithm guarantees to mark any structure. 

It requires a two-bit field in each node to allow marking. 

In a multiple garbage collector system, Lamport's algorijgg@ 
effectively requires each gerbag® collector to acceso ma 
physically separate portion of the node space. Each garbage 
collector examines nodes until ONE collector finds amd 
an accessible (shaded) node and then it shades its successors. 
All garbage collectors are then reset, restarting the dS E 
of their assigned node space. Consequently, with most 
Structures, only one garbage collector does any useful work 
between "resets" [Ref. 20: p. 367]. 

A substantially improved version of Lamport's algorithm 
is obtained by allowing each garbage collector to complete 
its sequential pass through its assigned section of the node 
Space before resetting, instead of resetting as soon as one 
garbage collector has found and "colored" a node. "DNE 


version has two advantages: 
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Pee Semeral garbace, collectors may find shaded nodes on 
each pass, be able to mark them, and shade their 
SUGEGESSOrs . 

(2) Successors to a marked node may themselves be marked 
in the same pass through the node space [Ref. 20: 

p. 90968. 
Meme and Song's algorithm is introduced next. 

Kung and Song's algorithm, which is a variant of Dijkstra's 
three-color collector, uses FOUR colors (white, off-white, 
gray, and black) and has two independent processors that 
share a common memory; one processor for list-processing and 
mee Otnmer for GC. This algorithm requires that the nodes of 
Mae tree list. always be off-white; this requirement shortens 
the marking stage because the nodes on the free list do not 
have to be marked (i.e. the free list does not need to be 
traced) [Ref. 3: p. 1145]. Each cell contains three fields: 
a left and right pointer field and a color field. 

¡me and "Song's algorithm also introduces a deque, which 
permits application of a recursive trace and mark method 
(this is different from Dijkstra's simple scan and mark 
method). The insert operations to the deque increases the 
overhead of the list processor, creating a new problem of 
Meo allocate the deque [Ref. I5: p. 1]. Although usage 
of a deque for the marking phase increases the complexity 
of proving the correctness of the system, a deque is used 
ISO awvord-poessibye access conflicts, since both 
processors may manipulate the deque at the same time [Ref. 


Io g IT 
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As previously stated, efficiency has not been speca ar am 
ly addressed in this paper, and neither has system perfon meani 
Kung and Song's algorithm satisfies the following prop- 

erties, which are important to the*systen"s peTrTormqe 
(1) The time to perform the marking phase by the garbage 
collector is independent of the size of memory, amd 


depends only on the number of active nodes. 


(2) The nodes on the free list do not have to be marked 
during the garbage collector's marking phase. 


(3) Only two active bits for encoding four colors are re- 
quired for each node. 


(4) Minimum overheads for the list processor are introduced. 
(Dijkstra's method does not satisfy properties one and two 


above) [Ref. 21: pL 


Kung and Song's system is designed such that the marking 
phase has an execution time proportional to the number 
active nodes and independent of size of the memory. The re- 
sults of their parallel garbage collection system show 
a parallel garbage collector is usually significantly Milome 
efficient in terms of storage and time than a regular gar- 
bage collector (ea OA 

The next method of concurrent GC is that introduce 
Bonar and Tevi ank 

Bonar and Levitan's method uses specialized hardwamem 
Content Addressable Memory (CAM), that creates a very mE EN 
real-time LISP system (they define a real-time list-process- 
ing system as having the property that the time required 


each of the elementary operations is bounded by a comc um E 
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independent of the number of cells in use). Each word con- 
"xu one cap laitled” LISP celdowith only three fields: a 
left (car) and right (cdr)field, which can both point to 
another LISP cell; and a garbage one-bit field (that in- 
dicates if the cell is free). The key observation about GC 
cueca cell is that pointers to any given cell can be 
located with only two CAM operations: 

(1) A CAM search of the left fields of all cells in memory. 

(2) A CAM search of the right fields of all cells in memory. 
Wien a free cell is needed, a CAM search is executed (using 
constructed routines or algorithms), for a cell whose gar- 
bage bit is set. Their algorithm requires that all cells be 
initialized with the garbage bits set and the left and right 
e puedas set to nil. A part of the list structure be- 
Ames potential garbage when any one of may pointers to it 
ete ted: this cam occur by using the funetions replaca 
Eu acd, that respectively replace pointers in the left 
Manda the right (cdr) fields of LISP cells. CAMS are 
Meent examine all memory cells in parallel and are con- 
Sw ed n such a manner that each word (a "word" is an 
ME n regular sub structure of CAMS) can compare its con- 
ene ather than its address, with a value broadcast by 
BEI Iis comparison process is done by».all+.CAM words 
Tanes ly. The €PU then interrogates the CAN to dis- 
cover which words, if any, match the broadcast value [Ref. 


Pap. le)... 
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Some advantages of this implementation are: 
(1) It performs all elementary operations 1 read com 
(2) All cells are available for use. 
(3) Even though the cost of a CAM has been estimated tombe 
1.5-3.0 times the cost of an equivalent size RAMIS 
is well suited for sufficiently inexpensive imp enemas 


tion with VLSI Tecmo» 


(4) It retrieves the correct value for a name associa NEM 
requiring only two CAM operations. 


(5) Strings and other dynamic data types (in addrtaonmm 
the simplified LISP cells discussed above) can be ele- 
gantly and efficiently integrated into the basic Seige 
without partitioning memory. 

Some disadvantages of this system are: 


(1) Circulay lists @eammot beweasil aco lected. 


(2) Does not support a virtual memory environment [Ref. 12: 
p. OIF 


Hibino's method is the next and last additional memes 
of concurrent garbage collection introduced in this papam 
Hibino proposed a special processor for a parallel 
system, which consists of two independent processes sharing 
a common database: the list process and the garbage col eM 
whichoeooperatem ue e perform garbage collection. 
Hibino's algorithm is designed on the condition thatomm 
active cells are bound in a linear stack: this condition 
necessary for a practical list-processing system. Add mM 
Itb necu cns iik bit per cell and has a scan-request-flag 
(SREQ) for the system (the SREQ is used by the garbage emmi 
lector for deciding whether to go into the reclaiming DE 
or to again perform the mark propagating phase) |Ret AA 


IE. 
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INE corras tie garbage collector of list modifica- 
Son toccurrentes. thus requiring only one mark bit for each 
cell. It applies a recursive trace and track method for 
mema ke phase (Dijkstra's algorithm applies a simple 
Sean and mark method) [Ref. 15: p. 6]. 

Hibino's special processor, which performs concurrent 
garbage collection, was actually implemented using two 
TOSBAC-40L processors (whose architecture is similar to the 
INTERDATA 6/16). The processor cycle time is less than 200 
nanoseconds. The processor is built using standard TTL 
NE technology, The microinstructions for this processor 
are specifically tailored for garbage collection [Ref. 16: 
eee Lio]. 

Tomo smmletniod requires very little overhead due to 
pulelism [Ref. 15: p. 8]. 

ie exe seeliOne provides am insight into future possi- 


bilities in the area of garbage collection. 


fee OTURE POSSIBILITIES 

Tits far the only application of parallel garbage col- 
cton Nas involved one or two processors. With the advent 
of VLSI technology, having multiple processors on a single 
chip is not only possible by potentially cost-effective. 
Lamport first explored the possibility of concurrent garbage 


collection in a MOGs OC 6 Sision environment. There are numerous 
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other possible multiprocessor Configurations, which (CAMAS 
volve any number and any combination of collectors and list 
processors. Two simple examples are: 


(1) A system with only one garbage collection processam 
and having a finite MiMmberY Otero msp mec scons 


(2) A system with several garbage collection processors 
and at least one list processor. 


The former example could be implemented by using a 
ion of Steele's method. Steele's synchronization inter koea 
would require redesign but may not be as complex as in his 
original method (e.g. require one primitive to lock onm 
the list processors, except the one list processor thai 
manipulating the ftreegimnst) Rer a 

The latter example could be implemented by havin Sa 
the garbage collectors in the same phase at the same time 
and then divide the work in each phase among them; or je 
could be implemented by assigning specific "spaces" (as in 
Steele's method) semi-permanently to the processors am m NE 
lowing each processor to collect garbage asynenroneoueia) 
[Ref. 10: p. 506]. One implementation of this method AN 
ple garbage collectors) could be to have all the garbage 
collectors active simultaneously. The garbage collector 
would still have four basic phases (only three if compacti 


not desired): 


(1) Set-up 
All nodes are marked as garbage by setting a mark pit 
(or bits). This is similar to Dijkstra'"s colorem 


(white cells). This phase is typically executed E 
by-product of the collection phase: 
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(2) Marking 
Mee eec eSssSible from the roots of the list struc- 
ture are marked as accessible. This is similar to 
Dijkstra's coloring black. The marking phase may have 
several stages in which the mark stage (color) of the 
node changes. This will necessitate having more than 
one marked bit. 


6) Collection 
All unmarked (white) nodes are added to the free list. 


E Compaction. [Ref. 20: p. 567] 

While concurrent garbage collection, using either hard- 
ware or software techniques, is being pursued as a desired 
mere in list-processing systems, future possibilities 
in the variations of other garbage collection methods are 
also being studied. 

examples anfextension of the previously discussed 
puwcurrent GC method (with multiple processors) could be 
BE wwcoabloeatauon of several processors to list-processing 
m urbase collection (i.e: perform GC or list-processing 
as necessary). Clever heuristics would be needed to describe 
Whether switching a processor at time "t" is desirable to 
forestall having to wait on an empty free list at time "t + 
er. 19: p. 50]. 

meeer example is in the case of one processor being 
ame shared between list-processing and GC; a special micro- 
wepe essor could be designed to switch to garbage 
SerceteonmONLY on completion of a list-processing primitive, 
which would eliminate many of Steele's synchronization 


Inca Ref. 10: p. 506]. 
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In the last example, it is known that garbage collect 
with one processor typically consumes a substen tai 
centage of the total computation time used by 11St™=proceses 
ing systems. Shifting storage reclamation costs, away from 
a program's run time overhead, to compile time would Teams 
the system cost of operating list-processing systems. MAS 
can be accomplished using a Deutsch-Bobrow scheme (which 
uses a combination of garbage collection and references 
methods). This scheme maintains reference counts in a way 
that can be expected to require mikce UN usual. It 
has the property that the counts need to be updated iai 
less often than by traditional methods. Moreover their 
method is incremental: consequently, unlike regular garbage 
collection, it is not disruptive of real-time computatagm 
PRet nace p.. 51d 

Automatic storage reclamation, when viewed in the Deutsch- 
Bobrow model, can benefit from compile time optimization 
ROPA SISTE 

These last three examples were only provided to 11M EMEN 
that studies in the variations of garbage collection Mos 
for list-processing systems are still receiving attente 
and will continue to do so as the popularity of list-process- 


ing systems grows. 


D. CONMEUSuoN 
With the advent of VLSI technology, many limitat Tomke 


restricting the number of processors (collectors or 11m 
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IS Dec ams etof cost or feasibility, are eliminated. 
Orwcourse, More processors in a system means more coordina- 
tion and control among the processors is necessary. This 
results in an increase in system complexity and in the syn- 
chronization algorithms. If implemented in software, this 
Wall mean an increase in the cost of maintaining these soft- 
ware algorithms. 

This chapter has shown that there exists a wide range of 
algorithms and techniques for concurrent garbage collection. 
VLSI technology has pushed the continuation of future studies 
im this area, as has the popularity of list-processing 
NSUems. 

Hunc Wast chapter of this paper provides conclusions about 


miewtcasMellity of conducting concurrent garbage collection. 


JI 


VII. CONCLU HON 


Storage reclamation in most list-processing systems. US 
cluding LISP, is a necessity. Moste I P implemen atnapi 
utilize garbage collection to reclaim storage (memory cae 
cupied by unused cells) as the data structures of a program 
grow and shrink. LISP is a highly interactive list-process- 
ing system, that is useful in many applications and is grow- 
ing in popularity. The flexibility and expressibilicy mop 
LISP have made it the "work-horse” of the Al community AMO 
12: DAA 

The use of garbage collection for reclaiming unused 
storage should always be considered when implementing a list- 
processing system [Ref. 4:'p. 506]. Is it better to utilize 
regular GC or concurrent GC in a list-processing system 
This is a design decision that must be made before imple- 
mentation. Both garbage collection methods (regular and con- 
current) have advantages and disadvantages. 

Regular garbage collection has the problem of unpredict- 
able suspension of users' programs while garbage collection 
is being conducted. This occurs when the free list hads epee 
exhausted.: As previously illustrated (and depending on the 
size of memory), regular GC can cause serious list-processing 
delays. which can occur any time the program needs a new 


cell AREA Ps 
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Comcurrent earbage collection eliminates these list- 
processing delays, which are caused by insufficient memory 
being available. Concurrent GC also allows program execution 
to continue without interruption even during periods when 
n seweolleetron Ys occurring. The difficulties in parallel 
weeoaee Collection are caused by the fact that the list 
pum@e@essor is modifying the list data (i.e. creating new cells 
Aaa cell pointers) while the garbage collector deter- 
mines which cells are garbage (non-accessible) [Ref. 15: p. 
1l. Which implementation is better? In the comparison be- 
tween regular GC and concurrent GC, the productivity of a 
parallel GC system can be as much as 150% that of regular 
panbace collection [Ref. 3: p. 1151]. 

mme idea of performing concurrent garbage collection has 
been around for some time. Knuth credits this idea to M. 
Minsky. Though concurrent garbage collection is an appeal- 
we idea for real-time processing applications, no papers 
were published until about 1975. Both Steele and Dijkstra 
were among the first to investigate such a System [Ref. 21: 
Steele in 1975 presented the first concrete al- 
gorithm for the problem of garbage collection. which in- 
volved a high degree of interleaving of the processors' 
demono Ref. 15: p. 1]. 

Garbage collection can be implemented with compaction 
or without compaction. As previously noted, GC with com- 


paction has been shown to have significant time gains in 
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terms of performance in LISP programs. Garbage cori on 
without compaction is typically performed in a two phase 
process of first tracing and marking all active cells and 
then sweeping all unmarked cells back to the free list. 
Garbage collection with compaction prevents "thrashing." 
While compaction is not a necessity even in a systems that 
utilize single-size cells, compaction in list-processing a PE i 
tems is recommended. 
Garbage collection with compaction may be the only way to 
prevent intolerably slow processing of lists having ele- 
ments which are widely scattered in memory. In these 
cases, automatic GC could be triggered when the numbenmem 
transfers from slow memory, per unit of time, becomes 
larger than a specified maximum [Ref. 14: p. 260]. 

This paper has attempted to show that concurrent storage 
reclamation in the form of garbage collection, with compaction 
and without compaction, is feasible today using software or 
hardware techniques; and if implemented will provide Detten 
services to the system users (i.e. less program execution 
interruptions, increased speed-up of list-processing Mu 
than regular garbage collection can provides 

Many possible methods of concurrent garbage collection 
systems have been presented, and study of the behavior and 
design (including implementation) of concurrent garbage 
collection systems is an increasing field of investigate 
as list-processing systems becomre more commonplace. These 


studies are occurring because regular garbage collection is 


unsatisfactory in many situations. For example, regular 
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garbage collection delays may cause more serious problems 
An aet raton and discomfort among interactive users. For 
example, in list-processing programs designed to control 
pseWcalsdevrces, such as robotics applications, suspending 
Mecane Operations for the time garbage collection normal- 
ly requires may be intolerable: consider the plight of a 
future tennis playing robot forced to halt in mid-swing while 
M performs garbage collection [Ref. 5: p. 491]. 

While this is a trivial exampie, imagine the consequences 
rec uUTar GC list-processing system, which is designed to 
Mide time, critical decisions, such as the system whose 
moon aga purpose is centered around "space defense." Now 
mrs ine wit program execution had to hait for garbage collect- 
ion at a most crucial time (e.g. unknown air targets approach- 
mwESShrinsten, D.C. from the NORTH at a high rate of speed): 
awis would not only be an unsatisfactory and unreliable sys- 
nube the results could prove to have unpredictable, dis- 
asterous, and extremely sensitive consequences. 

il wecemeurrcent GC is being Used at its maximum Capacity, 
EMEN COUENECS twice as much processing power as regular GC. A 
system with two processors (one dedicated to list-processing 
ea hier dedicated to garbage collection) will provide 
an increase in the execution speed and possibly a decrease 
in the total amount of memory required as well [Ref. 5: p. 


500]. With the decreasing costs of processors, the drawback 
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of requiring twice as much processing power is offset by 
the advantage of avoiding garbage collection interruptions. 
Wadler developed an idea that compares regular garbage 
collection and concurrent garbage collection using a term 
called the power drain of the collector, which is the ame 
of time that is used by the collector divided by the amame 
of time required by the mutator in a single cycle. Sime 
list-processing systems typically spend 10-30% of the 17g 
in garbage collection, the power drain of regular GC is 
typically 11-45%. For concurrent GC systems, this means mimiis 
the full capacity of a parallel collector will not normalki 
be used [Ret 5 v SD 
The ratio of the power drain between parallel and regular 
garbage collection can reach infinity. For example, in 
the situation where NO cells are used or released (i.e. 
the collector is never invoked), the “power drain” Jou 
regular drain" of a regular GC system is zero.  Howeuemm 
in parallel garbage collection, the power drain is One 
because the collector is still being kept busy even though 
it does not return any cells. Wadler has shown tha teu 
two processors operate at maximum capacity, the power 
drain is 2. This means that parallel GC Tequires tia MEE 
much processing power as regular GC," [Ref. 2: p. SiTi 
The power drain ratio between concurrent GG and regum 
GC will have its minimum value whenever the concurrent gar- 


bage collector is being used to its maximum capacity [Ref. 


A NIE 


A. ASSUMPTIONS 
While this paper has shown the feasibility of concurrent 


garbage collection, it must be remembered that several 
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assumptions were made in order to reach this conclusion. 


Some of the major assumptions in this paper are: 


(1) 


(2) 


(3) 


(4) 


(5) 


(6) 


(7) 


(8) 


(9) 


Economics is not considered an issue: the cost of 
adding additional hardware (i.e. processors) is con- 
sidered negligible, as is the assumption of having 
large memories. Memory space is not at a premium: 
large memory systems are becoming more commonplace. 


Processing time in interactive systems is not taken 
Memeo a deration. In other words, the only require- 
ment emphasized in program processing is that program 
execution will not halt due to GC activities. The 
Wnaeth of time for list-processing is not considered. 


Performance issues and proof of correctness are not 
discussed. 


DuaceNsmessues (e.g. productivity, system cost ef- 
fectiveness, throughput, and system viability), im- 
portant considerations in any system, are also not 

r ees ancludes the problem that most gar- 

bage collection algorithms seem to have in common: 

they spend a lot of time "coloring" or marking garbage. 
ice ae tticient 1 the application contains 

ieee data s erúuctüres that are modified only occasionally 
Geer = ive p.. S40]. 


Memory allocation, while related to memory deallocation 
(i.e. storage reclamation), is only briefly mentioned. 


All cells are of the same size. Cells of various sizes, 
Mien DEesecierarditrencime problem and solution, are 
not discussed. 


Future hardware costs will continue to fall; thus mak- 


ing it advantageous to pursue concurrent garbage col- 
lection with at least two independent processors. 


iremenpenation Of the list professor is a typical exïľst- 
IO ROCESSOT: consequently, its operation was not 
studied. 


Garbage collection in virtual memory systems was only 
briefly discussed. Garbage collection in a large 

VM system might be bypassed altogether, although this 
would degrade system performance by scattering the 
active cells sparsely throughout memory; thereby re- 
suc frequent paging [|Ret. 3: p. 1153]. 


mm 


(10) A basic understanding of list processing Tang udk -E 
in particular LISP, 1s reom edito grasp them, 
cussion of the various storage reclamation methods. 


(11) Overhead for the list processor is ignored: the defi- 
nition of overhead (in parallel garbage collection) 
is whenever the mutator performs any list data mods 
fication, it must then perform some additional operas 
tions to inform the garbage collector [Ref. 15: p. 
2]. For example, consider how frequently list modi 
fication occurs in list processing systems. If this 
occurred extremely often, it would be impossible to 
introduce parallelism into garbage collection. ROS 
tunately in the case of LISP, this list data modas 
tion occurs explicitly only when cons and Tepes 
operations are PERRA m 





While it is not the purpose nor intention of this page 
to prove any of the above assumptions, it still must Demonen 
that many of the above assumptions are critical to the des 
S YE implementation, and maintenance of a list-proces EB 
System that uses garbage collection (in particular concurrent 
garbage collection). The above assumptions should first be 
considered separately and then jointly when deciding on a 
particular scheme to take care of "garbage cells" in a list- 


processing Svs ten 


B. FINAL REMARKS 

The simplest example of a parallel garbage collect Tommie 
tem consists of two independent processes (the list processor 
and the garbage collector) that share a common memory. MOMIA 
processors must "cooperate" with each other to allow garbage 


collection. 
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Successful implementation of list-processing systems, 
with parallel garbage collection, gives a foothold for 
parallel list evaluation [Ref. 15: p. 8]. 

eO RME opara l lel CCS which involves a high degree 
Of interleaving of the processors’ actions, is one of the 
most challenging problems in parallel programming [Ref. 16: 
DENIS ine latest trend in the computing industry shows 
that software costs are increasing annually. This is due 
largely to increased maintenance costs of this software and 
eitemadssociated personnel costs related to developing and 
maintaining this software. On the other hand, hardware costs 
Meme been declining, and with the advent of VLSI technology 
have made it possible to have numerous processors on a single 
emp at a reasonable cost. Consequently, concurrent garbage 
collection using software is not considered the optimum solu- 
tion. Addition of a second processor for garbage collection 
mires very little overhead [Ref. 19: p. 50]. 

Ap Tices of processors continue to tall and as micro- 
coded processors become more — the design of these 
special processors becomes more feasible. If this is done, 
meene list processor can operate unimpeded; since processor 
interlocking is necessary, 1t could be achieved through 
special hardware in such a way that the list processor al- 
most never has to wait on the garbage collector and then only 


CO brief periods of time [Ref. 10: p. 506]. 


I 


Synchronization, which must be done mnmen tecno 
attempts to remove a node from an empty free list, can be 
accomplished if cell removal and cell appending operations 
are conducted at opposite ends of the free list. An empty 
free list should happen infrequently and any convenient 
synchronization primitive can be used to prevent this [Ref. 
LD SB 

It is the conclusion of this paper that concurrente 
bage collection is not only possible, but that it hase 
ready been implemented. The simplest (and least costly) 
method of parallel garbage collection with list-processing 
is to use one processor strictly for list-processing opem 
tions and another processor strictly for garbage collect ksi 
tasks. Parallel garbage collection with more than two in- 
dependent processors (easily attainable with VLSI techmnsmmu 
at a reasonable cost) is possible but is not considered 
optimum because of algorithm complexity and increased syn- 
chronization problems. 

Two independent processors designed with separate tasks 
(list-processing and garbage collection) will mean that the 
list processor will not have to wait while garbage collection 
is being conducted as in regular garbage collection systems. 
There are some immediate advantages to concurrent GC: 

(1) A net speedup in list processing operations. 


(1) An increase in the efficiency of memory usage [RES 
Sape. 











Consequently, it is recommended that concurrent garbage 
collection (with compaction) be implemented in a list- 
processing system with hardware techniques. The least costly 
and simplest method is to have one processor dedicated to 


list-processing and another processor dedicated to GC. 
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